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FOREWORD 


This book, which is based on a series of lectures given at the London 
School of Economics annually since 1931, aims at providing a course 
of pure mathematics developed in the directions most useful to 
students of economics. At each stage the mathematical methods 
described are used in the elucidation of problems of economic theory. 
Illustrative examples are added to all chapters and it is hoped that the 
reader, in solving them, will become familiar with the mathematical 
tools and with their applications to concrete economic problems. 
The method of treatment rules out any attempt at a systematic 
development of mathematical economic theory but the essentials of 
such a theory are to be found either in the text or in the examples. 

I hope that the book wiU be useful to readers of different types. 
The earlier chapters are intended primarily for the student with no 
mathematical equipment other than that obtained, possibly many 
years ago, from a matriculation course. Such a student may need to 
accustom himself to the application of the elementary methods 
before proceeding to the more powerful processes described in the 
later chapters. The more advanced reader may use the early 
sections for purposes of revision and pass on quickly to the later 
work. The experienced mathematical economist may find the book 
as a whole of service for reference and discover new points in some 
of the chapters. 

I have received helpful advice and criticism from many m^he- 
maticians and economists. I am particularly indebted to Professor 
A. L. Bowley and to Dr. J. Marschak and the book includes numerous 
modifications made as a result of their suggestions on reading the 
original manuscript. I am also indebted to Mr. G. J. Nash who has 
j-ead the proofs and has detected a number of slips in my construction 
of the examples. • • • 

R. G. D. ALLEN 

The London School of Economios* 

October t 1937 
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A SHORT BIBLIOGRAPHY 

It is essential to keep in mind, at all times, what mathematical analysis 
is about and the way in which it has developed over the centuries. Not 
only begiimers, but also accompUshed mathematicians, can profit by 
reading such short introductions as 

Whitehead : An Irdroduciion to Mathematics (Home University 
Library, 1911). 

Brodetsky : The Meaning of Mathematics (Benn’s Sixpenny Library, 
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good short treatments are 
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f The reader wishing to revise his knowledge of “ matriculation ** algebra 
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ranging from the elementary to the advanced in content and from the 
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THE USE OF GREEK LETTERS IN MATHEMATICAL 

ANALYSIS 


Thb English alphabet provides insufficient material for the notation of 
mathematical analysis and greater range and flexibihty are acquired by 
using letters from the Greek alphabet. There are no fixed rules governing 
the mathematical use of Greek letters but the following table indicates a 
practice which is fairly general. It should be noticed that the English 
letters given are those which correspond to the Greek letters in mathe- 
matical, not literal, usage. 


Greek 

English 

General usage 

a alpha 

a 

I 

P beta 

b 

h constants 

y gamma 

c 

j 

K kappa 

k 

1 


A lambda 

1 


fx mu 

m 


constants, parameters 

V nu 

n 


i xi 

X 

1 


rj eta 

y 

y variables 

C zeta 

z 

j 

TT pi 

V 


(special constants or variables fe.g. 77 

p rho 

r 


las the constant ratio of the circum- 

a sigma 

8 


ference to the diameter of a circle and 

^ T tau 

t 


p as a rate of interest) 

. <f> phi 

f 

\ 

0 cap. phi 

0 psi 

F 

g 

[ functional operators 

W cap. psi 

0 

1 

8 delta 

d 

1 1 

[operators indicating increments in 

A cap. delta 

D 

n 

[variables 

S cap. sigma 

8 

summation sign 

€ epsilon 

- 

a small positive constant 

6 theta ^ 


a positive fraction 


In trigonometry, a, ,8, y can denote constant angles and 8, 0 variable 

angles. 




CHAPTER I 


NUMBERS AND VARIABLES 


1.1 Introduction. 

It is conventional to divide mathematics into two separate studies, 
geometry and analysis. In geometry, the study of space and of 
spatial relationships, we investigate the nature and properties of 
curves, surfaces and other configurations of points in spacft:^ Mathe- 
matical analysis, including arithmetic and algebra, deals with 
numbers, the relationships between numbers and the operations 
performed on these relationships. The distinction between the two 
studies lies in the difference between the basic “ raw materials 
the spatial points of geometry and the numbers of analysis. 

As the mathematical technique is developed, however, we find* 
that the distinction between geometry and analysis becomes less 
clear and less relevant. The intended applications remain very 
different but the methods are seen to be abstract and essentially 
similar in nature. Mathematics involves the definition of symbols 
of various kinds and describes the operations to be performed, in 
definite and consistent ways, upon the symbols. The distinction 
between geometrical and analytical symbolism is more or less arbi-* 
trary. Further, we shall see that it is a simple matter to devise a 
method of connecting points of space and numbers, geometry and 
analysis. That such a connection is possible is seen by the use of 
graptiical methods in algebra and by the fact that elementary 
trigonometry is an application of algebraic methods to a study of 
spatial configurations. 

, The mathematical technique is abstract and must be developed 
apart from its applications. It is also essentiaSy logical* in character. 
Elementary geometry, for example, is largely an exercise in formal 
logic, the deduction of the consequences of consistently framed 

^ assumptions. development of mathematical methods, however, 
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soon necessitates the introduction of a concept foreign to formal 
logic, the concept of “ infinity ”, the infinitely large and the in- 
definitely continuous. It can be said, speaking broadly, that the 
methods of mathematics form a branch of formal logic extended in 
a particular direction to include the infinite as well as the finite. 
The only requirements of mathematics, additional to those of logical 
reasoning, are connected with the introduction of the infinite. 

The popular belief that logical and mathematical methods are in 
conflict has, therefore, no foundation. Mathematics, as a symbolised 
and extended form of logic, can only be regarded as an alternative 
to logic in a special sense. Some problems are adequately treated 
by relatively simple logical reasoning and the introduction of 
mathematical symbolism is then only destructive of clarity. But 
formal logic, with its limited range, is clearly insufficient for the 
elucidation of a vast number of problems and this provides the case 
for the development and application of mathematics. 

Though an abstract development, mathematics is not just a 
fascinating game of chess played in n-dimensional space with pieces 
of fantastic design and according to rules laid down in an arbitrary 
way. The methods of mathematics are designed primarily for actual 
or prospective application in the field of science, in the interpretation 
of phenomena as observed, abstracted and classified. The nature 
of the application must not, however, be exaggerated. The patterns 
of scientific phenomena, the laws obeyed and the imiformities dis- 
played, are not provided by mathematics. They are assumed into 
the system and mathematics can only express and interpret them, 
j help us to deduce their consequences, or to forecast what will happen 
if they hold, and tell us where to look for verification or contradiction 
of o]jr hypotheses. 

The methods of mathematics apply as soon as spatial or numerical 
attributes are associated with our phenomena, as soon as objects 
can be located by points in space and events described by properties 
capable of indication or measurement by numbers. The main object 
of 'the following development is the description of certain mathe- 
matical methods. most points, however, the nature and field of* 
application of the various methods will be noticed and discussed, 
the applicability of mathematical methods in economic theory 
receiving special consideration. 
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In concluding these introductory remarks, we can note that the 
fundamental bases of the mathematical technique have received 
much attention in recent years. It is indeed a fascinating set of 
questions that are posed in what can be described, for want of a 
better phrase, as mathematical philosophy. The nature of some of 
these questions can be observed at various stages in the following 
treatment, but the temptation to pursue them must be resisted. The 
further we probe into the fundamentals of mathematics, the more 
shaky does the whole familiar structure appear. This need not 
worry us miduly ; the crazy system works as no other system can. 

1.2 Numbers of various types. 

Numbers are usually taken as self-evident or imdefinable entities 
which do not require examination. It is easily seen, howpver, that 
the number system is far from simple and that numbers of very 
different kinds are freely used in arithmetic and algebra. It is con- 
venient, therefore, to spend a little time sorting out the various types 
of numbers and showing how they have developed side by side with, 
and by reason of, the growth of mathematical analysis from the 
simplest notions of arithmetic. Some hght is then thrown on the 
nature and applicabihty of mathematical analysis in its present 
highly developed form. 

We all begin, in arithmetic, with some idea of the^in<eyer,j, the 
natural or whole numbers, as concepts intimately connected with 
the process of counting or enumeration. It is not generally recog- 
nised that counting, and hence the integers, involves two quite 
’ distinct notions, the ordinal and the cardinal notions. The funda- 
mental property of integers is that they can be written down in 
succession without end : 

1, 2, 3, 4, 5, ... . 

The integers can thus be used to indicate any order or sequence and 
this is one of the objects of counting. We have here the ordinal 
aspect of counting and of the integers. But the integers do more 
than indicate order, for we can speak of four men, of four hats and 
of four sticks without any notion of order being necessarily involved. 
We simply mean that we have a collection of men, a collection of 
hats and a collection of sticks, and that the collections Have some- 
, thing in commryi expressed by the number four. There is, in fact, 
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a complete correspondence between the members of the three collec- 
tions ; each man, for example, may be wearing a hat and carrying 
a stick. This is the cardinal aspect of counting and of the integers, 
an aspect which enables us to say how many men, hats or sticks 
there are and which will later provide us with the basis of measure- 
ment. 

Developing the arithmetic of integers, we define the processes of 
addition and multiplication, the sum and product of two or more 
integers being themselves integers. Our first difficulty arises when 
we consider the converse process to multiplication. The division of 
one integer by another does not, in the vast majority of cases, pro- 
duce an integer. We can only say, for example, that “ three into 
seven won’t go exactly ” or that we “ put down two and carry one 
In order to introduce uniformity, the number system is extended by 
defining the fraction as a new type of number. The wider set of 
numbers, integers and fractions, is termed the system of rational 
numbers. Like the integers, the rational numbers can be written 
down in a limitless sequence of ascending magnitude, the order being 
indicated most clearly in the decimal method of writing the numbers.* 
The ordered rational numbers, however, display a property not 
shown by the ordered integers. The sequence of rationals is not 
only of limitless extent but also of limitless “ density As many 
fractions as we like can be written down to lie between any two 
numbers of the order, between \ and 1 for example. The processes 
of addition, multiplication and division can be extended to apply 
to the whole system of rational numbers. 

, Even with the rational numbers, however, it is found that the 
processes of arithmetic still lack uniformity. Consider the process 
of contracting the square root of a given number. The square roots 
of a few numbers, such as 36, 169 and 6J, are found at once as 
rational numbers. But, if rational numbers are alone admitted, we 
can only say that the square roots of most numbers, e.g. 2, 3 and 5, 
do not exist and we have again reached the unsatisfactory situation 
in Vhich we have to own that a simple arithmetic process breaks ^ 
down. For tuliformiliy, therefore, a further type of number is intro- 
duced, the irrational number. Irrational numbers include the square 

♦ A certain amount of duplication must be eliminated by writing l.i.S 

for example, in the simplest equivalent form ^ ^ 
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roots of numbers which are not “ perfect squares ” and also the 
solutions* of many equations and such numbers as those denoted 
by 77 and e which play extremely important parts in mathematical 
analysis. The definition and nature of irrational numbers will not 
be discussed here but it can be assumed that they fit into their 
appropriate places in the order of numbers according to magnitude.* 
Further, with some difficulty, we find it possible to extend the 
arithmetical processes to apply to irrational as well as to rational 
numbers. 

The next extension of the number system is useful in arithmetic 
but only becomes vital when algebra is developed. We have not yet 
referred to subtraction, the process converse to addition. When a 
smaller (rational or irrational) number is taken from a larger one, 
the result is an ordinary number of the same kind. This i.^ not true 
when the numbers are reversed. In arithmetic it is desirable, and 
in algebra essential, that the difference between any two numbers 
should be a number of the same system. This uniformity can only 
be achieved by doubling ’’ the number system so far described by 
the distinction between positive and negative numbers and by adding 
a new number zero indicating “ nothing The difference between 
any two numbers is now a number (positive, negative or zero) and 
Subtraction is a uniform process. By the adoption of further con- 
ventions, all arithmetic processes can be made to apply to positive 
and negative numbers and (with the exception of division) to the 
number zero. 

An important practical fact can be noticed in passing. The last 
extension of the number system makes possible a great simplification 
of algebraic work. In algebra we deal frequently with equations such 
as the following : ^ 

a;® -f-2a; +1 =0 ; x^ = 2x+3; 3x = 2x^+l; a;* = 5. 

We recognise these as examples of the same “ form of equation 
and (with the aid of zero and negative numbers) we can include all 
o^ them in one uniform expression. We write the equations : • 

x*+2x+l=0; a;*+(-2)a;+(-3)=:0; 23* +(^-5)5: +1 =0 ; 

+(0)a; +( - 6) = 0. 


• See 4.1 (footnote) below. 
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In the symbolism of algebra all equations of these kinds can be 
included in the general quadratic form : 

ax* +6x +c = 0 

where a, b and c stand for some definite numbers (positive, negative 
or zero). The use of symbolic letters, such as the a, b and c in the 
quadratic equation above, wiU be described more fully at a later 
stage. 


1.3 The real number system. 

The number system as now extended consists of the neutral 
number zero and of all positive and negative numbers, rational or 
irrational. This is the system of real numbers^ as commonly used in 
algebra and mathematical analysis. The rational and irrational 
numbers ^an be arranged in an order of limitless extent and density. 
The same is true of the real numbers except that the order is of 
limitless extent in two directions instead of one. AU negative 
numbers appear before the positive numbers in the order of magni- 
tude (with zero separatmg them) and we can proceed through larger 
and larger negative numbers as well as through larger and larger 
positive numbers. The real number system has also another pro- 
perty, i.e. the property that there are no ‘‘ gaps ’’ left in the order 
of the numbers. There are ‘‘ gaps in the order of the rationals but 
these are completely filled by the insertion of irrational numbers.* 

The familiar rules of arithmetic and algebra apply, therefore, to a 
system of real numbers with the properties : 

The numbers can be arranged in a definite order, the order of 
magnitude, of greater and less. 

y^The number order is of limitless extent in both directions, 
i.e. the numbers form a doubly infinite system. 

\(3) The number order is of limitless density and without “ gaps ”, 
i.e. the numbers form a perfectly continvxms system. 

The development of the real number system from the basic idea 
of a sequence of integers marks the end of a certain line of evolution 
and provides a convenient stopping place. One guiding principle 
stands out i«L •what have said. The introduction of each new 
number type wa^ designed to impose uniformity where uniformity 

* This i8*a statenjent we cannot justif}’ here. It is based on the work of 
Dedekind and Cantor ; see Hardy, Pure Mathematics (3rd Ed., 1921), pp. 1-31, 
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was previously lacking and, at the same time, to preserve all the 
essential rules of arithmetic and algebra. This is a most important 
principle which will be found at work throughout the development 
of mathematical analysis, the constant striving after generality, 
uniformity and simplicity. Herein lies much of the fascination of 
the study of mathematics. 

Even with the real number system, however, uniformity is not 
completely attained and, sooner or later, analysis reaches a stage 
where new extensions of the number system are required. The pro- 
cess of division, for example, still fails in one case since a real number 
divided by zero does not give a real number. We must turn a more 
or less blind eye to this deficiency though it will be partly side- 
tracked later when the idea of a “ limit is introduced. A more 
important lack of uniformity occurs when the solution of quadratic 
equations is considered in algebra (see 2.8 below). If real numbers 
are alone admitted, some quadratic equations have two solutions 
(distinct or coincident) and others have no solution at all. It is 
Lclearly desirable, from the point of view of uniformity, to be able 
to say that all quadratic equations have two solutions. This 
uniformity is, in fact, achieved by introducing what are termed 
‘‘ complex or imaginary ’’ numbers, a step which opens up entirely 
new fields of mathematical analysis. But we content ourselves here 
with the real number system which is sufficient for most of the 
mathematical methods with which we are concerned. 

Finally, the fact that the real number system is of limitless extent 
and density makes it only too easy to speak of infinitely large 
numbers and of numbers whose differences are infinitely small. 
Right at the beginning of our development, therefore, we find our- 
selves approaching the difficult coimtry of the “ infinite ”, of the 
infinitely large and the indefinitely small and continuous. A more 
determined expedition into this dangerous country must be left 
until a somewhat later stage ; it is sufficient here to remark on the 
need for such an expedition. 

» 

1.4 Continuous and discontinuous variables. 

♦ • 

The generalisation of arithmetic into algebra arfd of algebra into 
modem analysis is largely dependent on the extensicm of the 
symbolism. The number system remains the basfc element but its 
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uses are made more flexible by the symbolism. The first and most 
obvious distinction between algebra and analysis on the one hand 
and arithmetic on the other lies in the use of “ variable numbers, 
denoted by symbolic letters, instead of particular numbers. 

A variable number is any number, an imspecified number, from a 
certain given set of real numbers and it is always symbolised by a 
letter such bs x, y ov t. Particular numbers for which the variable 
can stand are called the values of the variable, and the whole set of 
possible values makes up the range of the variable. The use of a 
variable number necessarily implies a range of variation, a set of 
numbers from which the values of the variable can be selected at 
will. Many variables have a range including all real numbers and, 
in this case, no explicit reference to it need be made. Other variables 
have more restricted ranges, e.g. the range of positive numbers, of 
numbers between zero and unity or of integers only. Here the range 
should be clearly indicated in the definition of the variable. 

One particular case merits separate notice. If a variable x can 
take, as its values, aU real numbers lying between two given numbers 
a and 6, then its range is called the interval (a, 6). We write a <x <b 
or according as the values a and b themselves are excluded 

* or included. For example, 0<a:<l or indicate that the 

variable x has a range consisting of the interval of positive proper 
fractions, the numbers zero and unity being excluded in one case and 
included in the other. 

The notion of an interval provides a concept which later proves of 
great convenience. We often make statements about the values of 
a variable x “ near ” to, or in the ‘‘ neighbourhood ” of, a particular 
*value a. The statements are made in this vague way but their mean- 
ing can be made precise and should be borne in mind. A neighbour- 
hoo^oi the value a;=a is defined as an interval of values of x having 
the value a as its middle point. In symbols, if k denotes a given 
positive number, a neighbourhood of a; = a is defined by the interval 
(a ~ k) <x <{a ’\-k). Here, (a; - a) is numerically less than k and the 
total “ length of the interval is 2k. Though k is, in general, any 
positive nunj^er we care to fix, it is usually convenient to take k as 
small, i.e. to take*a small neighbourhood of a; =:a. 

An important distinction can now be made. A variable is con- 
tiniums if its range is either the whole set of real numbers or any 
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interval of the set. The adjective “ continuous here is of exactly 
the same connotation as the same adjective applied to the number 
system itself ; the values of a continuous variable can be ordered so 
that they are indefinitely dense and without gaps. It is often con- 
venient, for example, to think of a continuous variable as taking 
values successively in increasing order of magnitude and we describe 
this by saying that ‘‘ the variable increases continuously in value 
over such and such an interval Notice, however, that any 
changes in the value of a variable are essentially timeless ; the 
values of the variable make up a range of variation which must be 
considered as a whole, though it may be convenient to pick out or 
arrange the values in any definite way we like. 

A discontinuous variable, on the other hand, has a range which is ^ 
neither the whole set of real numbers nor any interval erf the set. 
The values of such a variable cannot be arranged in order of magni- 
tude without gaps. Any set of numbers of a particular type, e.g. the 
set of integers or of multiples of provides a range of variation 
which is discontinuous. For example, if the price of a commodity 
is X pence quoted only in halfpennies, then a: is a discontinuous 
variable taking values which are positive multiples of J. 

1.5 Quantities and their measurement. 

It is convenient, at this stage, to consider the use of our number 
system in the interpretation of scientific phenomena. One use 
springs to the mind at once, the use in the enumeration of discrete 
physical objects or events. In all such uses, whether we are counting 
(e.g.) men, road accidents or sums of money, the only numbers ^ 
required are the integers themselves. The “ unit of counting is 
here the number one corresponding to a single object or occurrence. 
For convenience, however, larger “units ” can be adopted ; we can 
count men or accidents in hundreds or thousands instead of in ones, 
sums of money in pounds, shillings and pence instead of in the 
smallest coin (e.g. halfpennies). The result of enumeration then 
appears as a fraction or a decimal instead of as a simple integer. 
Thus we can say that the number of road accidents isi55*7 thousands 
instead of specifying the number as the integer 55,700. Or we can 
say that a sum of money is £1 11s. 4|d. or £lH^ instead of^specifying 
that it contains 753 halfpennies. Hence, even in t\ie use of numbers 
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in enumeration, the range of the numbers employed can be quite 
extensive and the question of the “ unit of measurement arises as 
it does in more complicated cases. 

Enumjgration is not sufficient for the description of scientific pheno- 
mena. We derive, from our observations, what can only be caUed 
abstracted properties — shapes, colours, temperatures, pitches of 
notes, lengths, masses, time-intervals, and so on — and it is essential to 
connect these properties with numbers if possible. The two aspects 
of number, the ordinal and the cardinal, now become important. 
The ordinal aspect of number applies at once if the property con- 
sidered is capable of arrangement in some order, and this is 
the essential prerequisite for any association of numbers with the 
property. Some of the properties abstracted from scientific pheno- 
mena (e.g* temperatures and lengths) can be presented in order 
according to a criterion of greater or less, higher or lower or what- 
ever it may be ; other properties (e.g. shapes and colours) cannot be 
ordered in this way.* Hence we arrive at our first distinction. Pro- 
perties not capable of arrangement in order can be termed qualities 
and numbers are not directly applicable to them. Properties to 
.which a natural order is attached can be termed magnitudes and 
numbers can be associated with them at once. The number attached 
to a magnitude simply serves to indicate its position in its appro- 
priate order and the association is loose and by no means unique. 
The association is, however, of considerable importance in the 
scientific field, the Centigrade and Fahrenheit readings of tempera- 
ture providing a good example, and more will be said about it later 
^n. 

A second distinction can be drawn amongst magnitudes according 
as th® cardinal aspect of number can or cannot be applied in addition 
to the ordinal aspect. If a magnitude displays the additive property 
that any two of its values can be taken and added together to give 
a third value of the same magnitude, then numbers can be used in 
their cardinal aspect and the processes of arithmetic apply. Magni- 
tudes with the additive property can be called directly measuraijie 
quantises ani^fre shalltshow how numbers can measure, and not only 

* It may be possible, in the construction of physical theories, to analyse a 
quality such as colour and to connect it indirectly with numbers (e.g. by 
means of “ wave lengths **). This merely indicates that our distinctions are 
not hard and fast — as, of course, they cannot be. ^ 
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order, quantities. Lengths, masses, time-intervals and electric 
charges are examples of quantities ; temperatures and pitches of 
notes are instances of magnitudes which are not quantities. 

In order to fix ideas, let us consider the quantity length ; what 
follows applies, with suitable modifications, to other quantities such 
as mass or time-interval. The additive property of length makes it 
possible to construct a scale for measuring length. A definite length 
is chosen as standard and a number of such lengths are placed end 
to end to make the scale. This is possible since we know what is 
meant by equal lengths and since we can add lengths together to 
get other lengths. The standard length, definable in any convenient 
way, will be assumed here to be that length known as an inch. Any 
given length can be compared now with the scale of inches and 
defined as covering so many scale intervals, so many inches.® All that 
is required is the process of counting and we get lengths of 2 inches, 
10 inches, 143 inches, and so on. In other words, any given length 
is specified or measured by a number, such as 2, 10 or 143, when 
compared with the standard inch scale. 

A difficulty now presents itself. In almost aU cetses, the scale 
reading of any given length must be taken to the nearest number of 
inches. As a result, two lengths may have the same measure though 
direct comparison shows them to be unequal ; or the sum of two or 
more lengths may give a scale reading which is not the sum of the 
separate scale readings. The fundamental ordered and additive 
properties of length appear to have been lost in the process of 
measurement, a state of affairs which cannot be tolerated. The 
measurement of length must bo made to preserve the fundamental , 
properties. 

The first step is to admit fractional as well as integral numbers in 
our measure of length. We can subdivide the scale unit by taking 
(e.g.) tenths or hundredths of an inch and then compare the given 
length with the subdivided scale. Alternatively, we can apply a 
given length repeatedly to the fixed inch scale until the multiple 
length coincides with a scale reading. Thus, the length of 2| or £'75 
inches is such that it covers 276 of the hundredth inch Jinits, or such 
that a hundred such lengths together cover 276 of the inch units. 

In practice, this process will suffice since there must be some 
physical limit to the comparisons that can be made between a 
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given length and any actual inch scale. But the measurement of 
lengths by means of rational numbers fails in our theoretical require- 
ments of length. Pythagoras’ well-known theorem, for example, 
requires that the diagonal of a square of one inch side should be 
represented by inches, and is not a rational number. We 
must, therefore, conceive of lengths measured by irrational numbers, 
even though actual measurement with a scale of inches does not 
support us. 

What has happened now is that we have associated the concept 
of the infinitely small and continuous with the observable property 
of length and, like the abstract number system itself, we assume that 
length is continuously variable. In other words, a new and abstract 
concept of length is introduced ; we assume that there is a “ true ” 
length wWch is continuous but incapable of actual measurement. 
This assumption of continuity, which must be recognised as some- 
thing outside the sphere of actual observation, is imposed upon 
length for purposes of mathematical convenience, in order that the 
measure of length should display the additive property and obey 
the rules of arithmetic exactly as we desire. The assumption of 
, continuity into the phenomena we study is of frequent occurrence 
and the present instance is only the first of many. The only differ- 
ence between this and some later instances is that we have grown 
used to the assumption in this case, but not in others, so that we 
think it “ reasonable ” here but not there. 


In the description of scientific phenomena, therefore, there appear 
certain directly measurable quantities. Only three of these quan- 
^tities are required in mechanics, length for the description of space, 
mass for the description of weight or inertia, and time-interval for* 
the 46Scription of temporal changes. Other fundamental quantities 
must be added as the range of physics is widened. In electricity, 
for example, we need the new fundamental quantity of electric 
charge to describe the phenomena then included. Having selected a 
standard scale unit, we can express each quantity as a variable 
nuftiber. The recognised units of theoretical physics for measuring 
length, mas» tind thjie are the centimetre, gramme and second 
respectively. Other units are possible and are often used in more 


mundane descriptions ; the xmits of a foot and a pound can replace 
for example, the standard centimetre and gramme units. 
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Finally, the property of continuity is imposed upon the measure 
of each of the fundamental quantities. When we speak of a length 
of X centimetres, a mass of y grammes or a time-interval of t seconds, 
the numbers a;, y and t are assumed to be continuous variables subject 
to the rules of arithmetic and algebra. This seems reasonable 
enough, but the remarks above serve to indicate, though inade- 
quately, the serious abstractions and assumptions that lie behind 
bald statements about the measurement of fundamental quantities. 

1.6 Units of measurement. 

The measurement of a quantity, as defined above, is only unique 
if the standard scale unit is given. The choice of the standard unit 
of each quantity is open and can be made in a large number of ways. 
As we have noted, the scale unit of length adopted in theoretical 
work is almost invariably the centimetre. But inches, feet, yards, 
miles, millimetres, kilometres and rods or poles provide other recog- 
nised examples of length units. It appears, therefore, that there is a 
large element of arbitrariness in the measure of any quantity and 
the question arises whether there is any real distinction between 
quantities, with measures arbitrarily dependent on the choice of 
scale imit, and magnitudes in general, indicated by numbers without 
any pretence at uniqueness. It will bo shown that the distinction is 
real, that the arbitrary element in the measure of a quantity is much 
more limited and easily dealt with than in the indication of non- 
measurable magnitudes. 

There exists a very simple rule connecting the measures of a given 
quantity on two different scales. Let the second scale unit be 
measured and found to contain A of the first scale imits. If a 
quantity has measures x and y in the first and second scale qpits 
respectively, then we have a;=Ay, a relation which holds for anyone 
quantity whatever of the kind considered. 

Hence, the numerical measure of a given quantity changes when 
the unit of measurement is changed, but the change in the measure 
IS < 5 ^uite simple and follows the familiar rule of proportions. The 
proportion is fixed by A, the measure of one unit^n terms 3 f the other- 
For example, a length can be expressed either as 'x feet or as 12a; 
inches, whatever the value of x. The length measure' changes 
according to the fixed proportionality rule independently of the 
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length we happen to take. The connection between units may 
be expressed in fractional, or even irrational, form. One inch is 
measured as 2*54... on the centimetre scale and a length of x inches 
is thus (2*54...)a; centimetres. 

The measures of a series of quantities of the same kind on one 
scale are exactly proportional to the measures on a second scale. 
For example, the same set of lengths can be measured by 
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The limitation on alternative measmes can be expressed by saying 
^ that the ratio of the measures of two quantities of the same kind is 
iridependent of the units chosen. The ratio of the measures of the 
first two lengths instanced above is always \ ; the first length is half 
1 the second, a statement independent of scales. 

The arbitrary element in the indication of a magnitude which is 
not a directly measurable quantity is subject to no such limitation. 
A set of magnitudes of the same (non-measurable) kind arranged in 
ascending order can be indicated by any rising set of numbers 
whatever i 

1, 2, 3, 4, 5, ... ; 

1, 3, 5, 7, 9, ... \ 

1, 4, 9, 16, 25, ... , and so on. 

There is no connection between the various numbers that can 
indicate any one magnitude ; the question of changing units does 
not arise. 

. l.T^Derived quantities. 

From the limited number of fundamental quantities necessary 
I in the description of scientific phenomena can be obtained a large 
. number of secondary or derived quantities. The two most frequent 
S ways of defining derived quantities are by the multiplication and 
\ division of fundamental quantities of the same or dififerent kinds. 
* It is sufficiferit to qirote, as examples of derived quantities defined 
as products, the cases of superficial area and of cubic volume. Area 
is obtained as th^ product of two, and volume as the product of three, 
length measures. Probably more important, and certainly more 
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frequent, are the derived quantities obtained by dividing one funda- 
mental quantity by another. The quotient of distance travelled by ? 
an object divided by the time of transit gives the derived quantity 
known as speed. If the mass of a body is divided by its volume, we . 
obtain the derived quantity of density. The quantity of electric 
charge passing a given point in a wire divided by the time it takes to 
pass provides the derived quantity of electric current. Fm*ther, to 
quote a simple example from the economic sphere, a sum of money 
divided by the number of units of a commodity bought with the 
sum gives the price of the commodity. 

From their definition it follows that derived quantities can be 
measured, but only in terms of two or more standard units. A scale 
unit for each fundamental quantity used in defining a derived 
quantity must be chosen before the latter can be expressed in 
numerical terms. This is made clear by the way in which derived 
quantities are commonly specified ; speeds are indicated by measures 
such as 30 miles per hour or 25 feet per second, densities by so many 
pounds per cubic inch and prices by so many pence per pound. The 
measures of derived quantities, like those of the fundamental quan- 
tities, are numbers which are taken as continuously variable. 

It will be seen later that derived quantities include two distinct ^ 
types, the “ average ’’ type and the “ marginal type. Average 
speed illustrates the first type ; if a train travels 45 miles in an hour 
and a half, then we say that its speed, on the average, is 30 miles per 
hour. The second type is illustrated by velocity or instantaneous 
speed ; if the velocity of a train is 30 miles per hour at any moment, 
this implies that the distance travelled in a very short time-interval ^ 
from the given moment, divided by the time-interval, is approxi- 
mately equivalent to 30 miles in an hour. The price of a commo^Jity, 
as usually defined, is an average price. The corresponding marginal 
concept can be defined, however, and given the name ‘‘ marginal 
revenue a concept of considerable importance in economic theory. 
Both corresponding types of derived quantities are measured in the 
sanje way and in terms of the same pair of scale units. ^ ^ 

The various scales necessary for the measure, pf a derived quantity 
are at choice and, if any one unit is changed, then the numerical 
measure of the derived quantity is changed. But the tjhange is 
governed by a rule of proportions similar to that already described. 

1 
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Suppose, for example, that a derived quantity is obtained by the 
division of one quantity A by another quantity B, The units of 
measurement are changed so that one unit of the second scale of A 
equals units of the first scale, and one unit of the second scale 
of B equals A^ units of the first scale. A given value of the derived 
quantity is measured by x on the first scales and by y on the second 
scales. Then we have 


a; = 


Af, 


2 /- 


For example, the average speed of a train is 30 miles per hour. Now, 


30 miles in an hour implies 30 x 5280 feet in an hour, and so 


30 X 5280 
3600 


feet in a second. The speed is thus 44 feet per second. In general, 
a speed f)f x feet per second corresponds to y miles per hour if 


5280 2 

— 3600 y — 16 y* 


Again, if the average price of sugar is 3d. per pound, what is the 
price in shillings per cwt.? A cwt. or 112 pounds costs 33Gd. or 
28 shillings. The requii'ed price is 28s. per cwt. In general, a price 
of 0 C&, per cwt. corresponds to yd. per pound if 

X— 12 y— 3 y- 

So, one measure of a derived quantity is a constant multiple of an 
alternative measure obtained by changing the units of measurement. 
When the derived quantity is the quotient of two quantities, as 
often happens, the multiple is simply the ratio of the scale multiples 
of the separate quantities. 


1.8 The location of points in space. 

Our main purpose is to develop the methods of mathematical 
analysis. Any introduction of geometrical or spatial considerations, 
therefore, will be subsidiary to, and illustrative of, the analytical 
development. It will not be necessary to enlarge upon the funda- 
mental geometrical problems concerned with space ass an •abstract 
conception and defined in any consistent way we please. We awoid 
difficulties using mnlj the space of our senses, taken as possessing 
' the properties of being infinite, continuous and metrical. Primary 
sensations give jis a conception of space and spatial configurations 
and tell us that we can assume that distance between points of space^ 
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is measurable (in definite units) and continuously variable. Further, 
space can be regarded as of limitless extent in one, two or three 
dimensions as the case may be. 

The geometrical considerations given here can, therefore, be illus- 
trated by drawing actual diagrams in the plane of the paper (two 
dimensions) or by sketching plane representations of spatial models 
(three dimensions). It is possible, by giving up visual representations, 
to extend our geometry to spaces of more than three dimensions 
with the same properties as those given above. But this would 
require a precise re-definition of aU the geometrical concepts which 
we are taking as self-evident, a task which it is scarcely appropriate 
to imdertake here. Occasionally, however, we shall proceed by 
analogy and refer to n-dimensional “ distances ‘‘ flats ’’ and 
‘‘ hypersurfaces ” as the analogues of three-dimensional distances, 
planes and surfaces. It must be remembered that such geometrical 
terms are introduced only as illustrative of the analytical methods. 

The spatial properties we have assumed here are clearly analogous 
to the properties of the real number system. This analogy makes it 
possible to locate points in our metrical space by means of numbers, 
the method being derived from the way in which we measure 
distances in space and depending on the number of dimensions of 
the space. 

Space of one dimension consists of a continuous and indefinitely 
extended straight line which we can denote by L. A practical 
method of fixing the position of a point on a line is to state the 
distance, in definite units, of the point from a given base point : the 
position of a train on a railway track, for example, can be defined 
in this way. We can easily express this method in a precise form 
suitable for the line representing our one-dimensional space. A con- 
venient point 0 is selected on L and a definite scale of measurement, 
e.g. one unit = J inch, is taken. Taking for convenience of wording 
the line L as ** horizontal ”, a convention is adopted whereby all 
distances measured from 0 to the right are positive and all distances 
to the left negative. Then each point on the line L is located without 
ambiguity by a number, the number of units (e.g. \ inches) of the 
dlistance of the pomt from O. The number^ can ba positive (if the 
point is to the right of 0) or negative (if the point is to the left of 0), 
So, the point located by the number -h3 is 3 units of distance to the 
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right of 0 and the point locfikted by ~ J is a half unit of distance to 
the left of 0. Since the number system and the points on L are con- 
tinuous^ each point on L is represented by one definite number and 
each number corresponds to one definite point on L. The method 
depends entirely on the fixing of 0 and of a definite scale ; if a 
different base and scale are chosen, an entirely different correspond- 
ence, still unique within itself, is set up. 

The next step is to locate points in two dimensions, i.e. in a 
plane which can be visually represented by the plane of the paper. 
Experience again provides a practical indication of the appropriate 
method. A position in a flat coimtry is conveniently located, as on 
an Ordnance map, by giving the number of miles the position is 
north or south of a given base point, and the number of miles it is 
east or west of the same point. The essential fact is that two 
’ numbers or measurements are required. To make the method pre- 
cise, fix two straight lines Li and in the plane which cut at right 
angles in a point 0. In the visual representation, and for convenience 
of wording, the first line can be taken as horizontal and the second 
line Lg vertical, the positive direction of being from left to right 
and of Lg upwards. A definite unit is chosen which serves for 
measuring distances along both lines. From a point P in the plane, 
‘perpendiculars PM and PN are drawn to the lines and La- The 
point P is then located without ambiguity by two numbers, the first 
being the number of units in the distance OM and the second the 
number of units in the distance ON. In Fig. 1, P is located by the 
number pair (4, 3), Q by (3, 4), B by (|, - 2) and aS by ( - 4, - f ). 

Of the two numbers which locate the position of a point P, the first 
‘ is the distance of P firom 0 parallel to L„ i.e. to the right or loft of 
La ; if this number is positive, P is to the right of and if negative, 

P is to the left of La* The second number is the distance of P from 0 
parallel to Lj, i.e. the height of P above or below Lj ; P is above Lj 
if the number is positive and below Lj if the number is negative. 
The lines L^ and La divide the whole plane into four “ quadrants 
In the N.E. quadrant both numbers are positive, in the N.W. 
quadrant the first nmnber is negative and the second positive, in the 
S.W. quadrant both numbers are negative, and in the S.E. quadrant 
the first number is po&tive and the second negative. The order of 
writing thqi numbers is important ; an interchange of the numbers 
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(if unequal) alters the position of the point (see P and Q of Fig. 1). 
From the continuity of the number system and of the plane, each 
point of the plane is represented by a definite pair of numbers and 
each pair of numbers by a definite point' of the plane. Relative to 
the fixing of and and of the scale of measurement, the corre- 
spondence between number pairs and points is quite unambiguous. 

Suppose a table is given showing corresponding values of two 
variable quantities. Taking squared paper and ruling two perpen- 
dicular lines along which values of the two quantities are respectively 
measured, each pair of values can be plotted, by the above method, 
as a point on the squared paper. The result is a graphs showing a 



Scale: One unit « }i Inch 
Fio. 1. 

group of plotted points, from which the relative changes in the 
quantities can be traced. It is not necessary here to take the same 
scale of measurement for each quantity ; the units in which 4he 
quantities are measured are more or less arbitrary and each scale 
can be chosen for convenience of plotting. Examples I provide 
practical instances of graphical representations of this kind. 

The location of points in space of three dimensions can be carried 
out in an exactly similar way. Three straight lines Lz and are 
fixed to intersect at right angles in a point 0, With defimte positive 
directions along the lines and a definite scale of measurement for 
all distances, the position of a point P is located ii> space as follows. 

*A rectangular ^‘box ’’ is constructed with three adjacent sides OM, 
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ON and OL along the lines and J &3 respectively, and with P 

as the vertex opposite to 0. The position of P is then located by the 
set of three numbers which represent respectively the number of 
units in the distances OM, ON and OL, due regard being paid to sign. 
Remarks similar to those given in the case of two dimensions apply 
to this three-dimensional method of locating points. 

1.9 Variable points and their co-ordinates. 

The importance of the methods of locating points in space is seen 
when they are applied, with suitable modifications of terminology 
and notation, to connect variable points and variable numbers. 
A variable point in space is a point which can take any position from 
a given range of possible positions. As a point varies in position, so 
the numbers which serve to locate the point vary in value. From 

this simple fact, we can construct 
a most powerful theoretical tool 
applying to both analysis and 
geometry. 

We can start with the two- 
dimensional case and, since we 
are considering variable points 
in general, we can give up any 
attempt at scale drawings and 
rely upon geometrical reasoning 
illustrated by simple diagrams. 

Fig. 2, Denote two given perpendicular 

lines by Ox and Oy, drawn with 
positive directions as shown in Fig. 2 and with the line Ox taken 
horizontally and the line Oy vertically. The point 0, in which 
the lines intersect, is called the origin of co-ordinates and 
the lines Ox and Oy are called the co-ordinate axes or the axes of 
reference. Any point P is taken in the plane Oxy and perpendiculars ^ 
PM and PN are drawn to the axes. With a definite scale of measure- 
ment for all distances, let OM contain x units and ON y units of 
distance. Ihe point*P is fixed by the number pair (a:, y), where the 
numbers must be written in this definite order. The point P is said 
to have co-ordinates (cc, y), the first being the x-co-ordinate and the 
second the y-co-ordinate. Alternatively f the co-otdinate along the"" 
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horizontal axis {Ox here) is called the abscissa and the co-ordinate 
along the vertical axis is called the ordinate of P. 

Once the axes and the scale of measurement are fixed, all points 
are located uniquely by their co-ordinates. As the point P varies 
in the plane, so the co-ordinates become variable numbers with 
certain definite ranges of variation. If P varies over the whole plane, 
then the variables x and y each have the whole set of real numbers 
as their range of variation. If P varies in a more restricted way, 
then the variables x and y have ranges which may be continuous or 
discontinuous and related in all kinds of ways. To a variable point 
in two dimensions, therefore, there corresponds a pair of variable 
numbers ; much use of this will be made later. 

There are several ways of interpreting the co-ordinates of a point. 
The point (a;, y) can be regarded as distant x units perpendicularly 
from the y-axis and y units perpendicularly from the a:-axis. In 
other words, the point is a distance x units to the right or left of the 
vertical base line and a height y units above or below the horizontal 
base line. From another point of view, the point {x, y) is reached 
by going a distance x units from O along Ox and then a distance 
y units parallel to Oy, or conversely. 

The co-ordinates of particular points should be noticed. The 
origin 0 is itself a point in the plane and its co-ordinates are seen to 
be (0, 0), Any point on Ox must have zero for its y-co-ordinate and 
it appears as (x, 0), where the (positive or negative) distance of the 
point from 0 is x units. In effect, therefore, a point on the line 
which is taken as the a;-axis need have only one co-ordinate, and a 
variable point on this line is represented by one variable number x. 
Similar remarks apply to a point with co-ordinates (0, y) lying on 
the y-axis. Finally, points in the various quadrants of the pla^e 
have co-ordinates whose signs are determined by the rules already 
given (1.8 above). 

/ The extension of the representation of points by co-ordinates to 
the case of three-dimensional space presents no essential difficulty. 
Threft perpendicular co-ordinate axes Ox, Oy and Oz must now bfe 
fixed, intersecting in the origin 0 and with positive flifections as 
shown in Fig. 3. For convenience, the plane Oxy can be taken as 
horizontal, the axis Oz then being vertical. Dropping perpeildiculars 
from any point P«in space ^to the planes Oxy, Oyz and Ozx (called 
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the co-ordinate planes) and completing the rectangular box '' with 
sides OJf, ON and OL along the axes, let OM contain x units, ON 
y units and OL z units of distance. Then the position of the point P 
is fixed by three co-ordinates (x, y, z) written in this definite order. 
For fixed co-ordinate axes and a fixed scale of distance measure- 
ment, the variation of a point in space is described by three variable 
numbers, the variable co-ordinates of the point. Two of the variables, 
X and y, determine points in the horizontal plane Oxy while the third 
variable z determines the height of the variable point in space above 
or below the corresponding point in the horizontal plane. 



There is no need, of course, to adopt only the point of view 
taken so far in which variable points are represented by variable 
co-ordinates. Since the connection between points and co-ordinates 
is quite unambiguous (relative to fixed axes and a fixed scale of 
measurement), we can equally well start with variable numbers and 
represent them by a variable point in space. For example, given 
two variables x and y related in any definite way, we can fix two 
axes Ox and Oy in a plane together with a scale of measurement and 
so obtain a corresponding variable point P (x, y) in two-dimensional 
space. All this is independent of the sjnnbols adopted for the 
variables concerned ; to a pair of variables denoted by p and g, 
there corr&ponjis cf variable point (p, g) in a plane referred to two 
perpendicular axes Op and Oq, 

The perfect two-way ’’ connection we have described between 
variable points and variable numbers qlearly provides the Unk wo 



NUMBERS AND VARIABLES 


require between geometry and analysis. This link, as we shall see, 
enables us to translate an analytical problem into a geometrical or 
diagrammatic one, and conversely. We have, in fact, established 
one of the most useful methods of the mathematical technique. 


EXAMPLES I 

The measurement of quantities • 

1. Explain how a scale of measurement for mass, considered as a funda- 
mental quantity, can be constructed. How is a given mass compared with 
the scale ? Must continuity be imposed upon the measure of mass ? Consider 
the other fundamental quantity, time-interval, on the same lines. 

2. An isosceles triangle has two equal sides of one inch enclosing an angle 
of variable size. Show how the length measure of the third side can give a 
scale for measuring angles of less than two right angles, thus deriving an 
angular measure from length measure. What is the measure, on this.system, 
of angles of 60® and 90® f 

3. Assuming that lengths can be measured along the circumference of a 
circle, obtain an alternative length measure of angles, showing that angles up 
to four right angles can now bo measured. How can this be extended to angles 
of any size? What is the relation of this length measure to the familiar 
measure in degrees ? 

4. Illustrate the change of measure of a quantity by expressing 321 inches 
in feet ; 1*63 yards in inches ; 26^ cwts. in pounds ; 4897 pounds in tons and 
0*285 hours in seconds. 

6. Given the connection between a foot and a centimetre, find an approxi- 
mate measure of a centimetre in inches. Express 6 inches in centimetres, 
3i yards in metres and 1 mile in kilometres. 

6. From the measure of a pound in grammes, find the approximate 
measure of a gramme in ounces. Express 2 ounces in grammes and 12 pounds 
in kilogrammes. 

7. Express 6072 square inches in square yards and 0 038 cubic feet in 

cubic inches. One acre is 4840 square yards ; how many acres are there in a 
square mile T ^ 

8. Express a speed of 20 feet per second in miles per hour. What is the 
measure of a speed of 60 miles per hour in feet per second and in metres per 
second ? 

9. A unif orm body weighs 6 cwts. and has a volume of 6 cubic feet. Find 

the (Jensity in pounds per cubic foot and in oimces per cubic inch. ^ 

10. A cube of imiform substance has a side of 2 inches and weighs 3 pounds. 
Find the density in pounds per cubic inch and in gr?,mm^s per cubic centi- 
metre. 

* In these examples, take 1 foot =30*48 centimetres and 1 pound =4636* 
•grammes. • 
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11. For a particle of mass m and velocity v, we define 

Momentum =m,v and Kinetic Energy = . v*. 

How are the measures of momentum and kinetic energy expressed in term 
of length, mass and time units f 

12. A body of mass 2 poimds is moving at a uniform speed of 30 feet per 
second. Find its momentvim and kinetic energy in pound-feet-second units 
and also in centimetre-gramme-second units. 

13. Illustrate the fact that the derivation of secondary quantities can be 
carried beyond the first stage by expressing acceleration (the change of velocity 
over time) in distance and time units. The acceleration due to gravity is 
approximately 32 in feet-second units ; what is its measure in centimetres 
and seconds ? 

14. A retail tobacconist buys a brand of tobacco at £6 9s. lOd. per 10 pound 
bag wholesale. Adding 16% for his expenses and profit, what should be the 
retail price per poimd (to the nearest penny) and per oimce (to the nearest 
halfpenny) ? 

16. The number of acres of land used in wheat production , the amount of 
wheat obtained in bushels and the total cost of production are all known. 
Explain the derived nature of the concepts of average product per acre and 
of average cost per bushel of wheat. 


16. From the data given in the table : 


J^nd in 

Area 

(acres) 

Wheat production 
(bushels) 

Total cost 
<£’8) 

Great Britain 

12 

467 

97 

Canada 

186 


326 

France 

67 


233 


find the average product per acre and the average cost per bushel of wheat 
in each case. How can these derived figures be used to compare wheat 
production in the different regions named ? 

17. Assuming that a bushel of British wheat, produced under the condi- 
tions of the previous example, is sufficient for 60 pounds of bread, calculate 
the cost of production of the wheat content of a two pound loaf. The wheat 
pr«^ucer makes a profit of 7J% and the loaf sells at 4d. ; what proportion of 
the selling price goes in baking and distribution 7 

18. A small chocolate firm employs 16 hands at an average wage of 46s. 
per week ; overheads and materials cost £41 6s. per week and the output of 
the firm is 34 tons of chocolate per year. Find the average product in poimds 
per hand per week and the average cost of a J pound block of chocolate in 
pQUoe. 

19. Show 4hat the rgte of simple interest is a number expressed only in 
time units, varying inversely with time. How is the rate of interest per year 
reduced to a rate of interest per week? What is the amount of £100 at 
r% simple interest per year at the end of n years? 

20. A man pawns his watch for 18s. and is^ charged per week by way* 
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of interest. Calculate the rate of (simple) interest per week that is implied by 
this charge. If 12% per year is taken as a standard rate of interest for this 
type of loan, is the pawnbroker extortionate in this case 7 

Graphical methods 

21. Selecting suitable rectangular axes in a plane, plot the points with co- 
ordinates (4, - 1) ; ( -i 3) ; (2, J) ; (2-3, - 1-7) and ( -2-8, 0). 

22. Show that the three points whose co-ordinates referred to rectangular 
axes in a plane are A (6, 7), B (9, 3) and (7 ( - 2, - 4) form an isosceles triangle 
ABC, Which are the equal sides? By direct measurement, determine 
whether the other side is longer or shorter. 

23. By plotting a graph, show that the points with co-ordinates ( - 3, - 1) ; 
(6, 2) ; (1, -3) ; ( -3, 6) ; (6, -1) all lie on a certain circle. Locate the 
centre and measure the radius of the circle. 

24. Four points O, P, Q and R are marked in a plane country. OR is a line 
in a W.E. direction and P is 12 miles from 0. The position of P is 3 miles 
E. and 4 miles N. of O, and that of Q is 4 miles W. and 6 miles N. of R. What 
axes and scales are suitable for locating the points graphically? Find the co- 
ordinates of the points in the system you choose, plot the points on graph 
paper and measure the distances OP, PQ and QR, Estimate the length of the 
shortest route from 0 to R calling at P and Q and check your result by using 
Pythagoras* Theorem. 

25. Illustrate the arbitrary element in the location of points in a plane by 
rectangular co-ordinates by considering the following. 

Three points A, B and O lie in a plane in which lines Li and Lx (Lx hori- 
zontal and Lx vertical) are fixed. A is 3 inches above Li and 2 inches to the 
left ot Lx; P is 6 inches above Lx and 10 inches to the right of L, ; O is 1 inch 
below Lx and 8 inches to the right of Lx, Find the co-ordinates of the points 
referred to Lx and L, as axes when the scale unit is equal to (a) one inch, 
(6) one-tenth of an inch, and (c) one foot. 

A point O' is now taken 2 inches below Lx and 1 inch to the right of L,. 
Straight lines Lx' and L,' are drawn through O' parallel to Lx and L, respec- 
tively. Find the co-ordinates of the three points on each of the above scales 
when Lx' and L,' are axes. 

26. Three mutually perpendicular axes L, and L, are fixed to intersect 
in a point 0 in space (L, being vertical). Find the co-ordinates of the point A 
which lies inches from O on L, ; of the point B in the plane of Lx and L, 
which is 6 J inches from 0 parallel to Lx and 2 inches parallel to L, ; and of the 
points G and D which are 4 inches above and below B respectively. Dra’^ 
a rough diagram to illustrate. 

27. Choosing convenient scales, plot a graph of the pairs of values of the 
variables x and y given below : 


X 

• 

1 

2 

3 

D 

5 

6 

D 

8 

9 

10 

y 

108 

81 

63 

60 

40 

32 

26 

mm 

HI 


13 


Draw a freehand curve through the plotted points. Is there any evidence of 
a “ law ** connecting changes in x and t/? Find an approidmate value of y 
Vh^^l^c = 3•6, \ 
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28. The following pairs of values of x and y are given ; 


x 

-9 

-7 

-6 

-3 

-2 

0 

1 

3 

4 

6 

8 

9 

y 

100 

9*4 

7-7 

60 

3 4 

0 

-1-7 

-60 

-6*4 

-8*7 

-9*8 

-100 


Plot the corresponding points on graph paper, draw a smooth curve through 
the points and find the values of y when a; is - 6 and 6. What can be deduced 
about the variation of y when x varies? 


29. Show, by a graphical method, how the pressure on a piston of a steam 
engine varies at different positions of the stroke when the following facts are 
given : 


Position of piston (inches from 
beginning of stroke) 

0 

1 

4 

6 

8 

11 

15 

19 

20 

Pressure (000 lbs.) - - - 

360 

36-6 

36*5 

33-5 

26-6 

17-5 

13-7 

90 

1*85 


30. The following table (data from Schultz : Statistical Laws of Demand 
and Supply) gives p, the yearly average N.Y. wholesale price of sugar, and g, 
the yearly U.S.A. consumption of sugar, for the period from 1900 to 1914. 


Year 

p (cents 
per pound) 

g (Mn. 
short tons) 

Year 

p (cents 
per pound) 

g (Mn. 
short tons) 

1900 

6-32 

2-49 

1908 

4-96 

3-57 

1901 

505 

2-66 

1909 

4-77 

3-66 

1902 

4-46 

2-87 

1910 

4-97 

3-75 

1903 

4-64 

2-86 

1911 

6-34 

3-75 

1904 

4-77 

310 

1912 

504 

3-93 

1905 

5-26 

2-95 

1913 

4-28 

419 

1906 

4-51 

3-21 

1914 

4*68 

4-21 

1907 

4-66 

3*35 





Plot a graph, one point for each year, to show the related variations of p and g, 
(Such a graph is called a “ scatter diagram ”.) Is there any evidence that a 
rise in consumption accompanies a fall in price 7 

31. Points in a plane can be located by means of co-ordinates with reference 
to axes which are not perpendicular (oblique co-ordinates). If lines Li and 
imersecting in 0 at any angle, are taken as axes, the co-ordinates of a point P 
are {x, y), where P is reached from 0 by going a distance x units along Li and 
then a distance y units parallel to L,. Illustrate with a diagram. 

32. The axes are at an angle of 45° ; plot the points whose co-ordinates are 
given by the following pairs of numbers : 

(2,2); (0,3); (-4,0) and (-3,-2). 

(Scale : one unit =one inch.) 

33. A parallelqgraiA has a horizontal side OP of 4 inches and a side OR of 

3 inches at an angle of 60° to OP. Draw the parallelogram on graph paper 
and fmd»the co-ordinates of the comers and of the point of intersection of the 
diagonals when the axes are taken as (a) OP and OR, and (6) horizontal and 
vexrtical lines at 0. « , 
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34. An entirely different method of locating points in space is by means of 
polar co-ordinates. A point O and a direction Ox (taken horizontally) are fixed 
in a plane. The polar co-ordinates of a point F are defined as (r, d)y where the 
distance of P from 0 is r units and where OP makes an angle of 6°, in the 
counterclockwise direction, with Ox. Illustrate with a diagram and by 
plotting a graph showing the points whose polar co-ordinates are 

(3,60"); (1.300"); (f, 0") and (|, 135"). 

35. Referred to rectangular axes, four points have co-ordinates 

(1,1); (-2,-2); (3,0); (0,-2). 

Fixing the point O and the direction Ox, find the polar co-ordinates of the 
points. Illustrate by plotting the points on graph paper. 

36. An observer, standing at a position 0 in a plane country, locates four 
landmarks A, B, G and D. A is 10 miles away in a N.N.E. direction ; D is 
due S. and at a distance of 7 miles ; G is W.N.W. and 8 miles away and D is 
distant 15 miles in a S.E. direction. Plot a graph showing the four landmarks 
and find the polar co-ordinates of the plotted points referred to O and the E. 
direction through O. (Scale ; 5 miles =one inch.) 



CHAPTER n 


FUNCTIONS AND THEIR DIAGRAIilMATIC 
REPRESENTATION 

2.1 Definition and examples of functions. 

In mathematical analysis we are concerned with variable numbers. 
The important thing about variables is not the way in which each 
varies hj itself, but the way in which different variables are related 
one to another. Mathematical analysis, in short, is the study of 
relationships between variable numbers. A simple technical term 
is used to describe and symbolise a relationship between variables, 
the term function 

The notion of a function is an abstract and general one but 
essentially very simple. In order to emphasise the simplicity of the 
notion, we can conveniently introduce at once a few examples of 
functions as applied in scientific work and everyday life. In applying 
mathematical analysis to actual phenomena, the physical and other 
quantities of the phenomena are expressed by variable numbers. 
Any observed or assumed relation between the quantities then 
corresponds to a functional relation between the variables. The 
following examples make this clear. 

If a lead peUet is dropped from a height, we know that the distance 
it Jjas fallen (in feet or some other unit) depends on the time-interval 
(in seconds or some similar unit) since the moment of dropping. 
There is a relation between distance and time and the distance 
fallen is a function ’’ of the time-interval. Similarly, if a gas is 
subjected to pressure at a constant temperature, its volume varies 
with, i.e. is a “ function ” of, the pressure. To take a more everyday 
example, the' amount paid in inland postage on a parcel (in some 
monetary units) depends on, and is a “ function ” of, the weight of 
the parcel (in pomida or a similar unit). Finally, in economics, the 
amount of a certain commodity demanded by % given market 
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taken as connected in some way with the market price of the com- 
modity, i.e. demand is a “ function ’’ of price. 

In each of these examples, there are two variable quantities repre- 
sented, in specific units, by variable numbers. The quantities, and 
the associated variables, do not change independently of each other ; 
there is a connection between corresponding values, a dependence 
of one quantity upon the other. The idea of a function, therefore, 
involves the concepts of a relation between the values of two vari- 
ables and the dependence of one variable on the other. 

Formal definitions of two technical terms can now be given. 
There is an implicit function or functional relation between two 
variables x and y, with given ranges of variation, if the values that x 
and y can take are not independent of each other but connected in 
some definite way. If the value of a; is known, then the value or 
values that y can take are fixed and not arbitrary, and similarly 
conversely. An implicit function is thus a mutual relation between 
two variables and either variable “ determines ” the other. The 
variable y is an explicit function of the variable x if the value or 
values of y depend in some definite way upon the value which is 
allotted arbitrarily to x. In this case, it is the variable x which 

determines ’’ the variable y. In the same way, x may be given as 
an explicit function of y. 

It follows at once that a given imphcit function between x and y 
fixes two explicit functions — y as an exphcit function of x and x as 
an explicit function of y. These two explicit functions are said to be 
inverse to each other. Conversely, each given explicit function must 
arise from some implicit fiuiction and has its corresponding inverse 
function. The difference between implicit and explicit functions is 
mainly one of point of view or emphasis. If the relation betweei^a; 
and y is regarded as mutual, then the term implicit function is 
appropriate ; the variables are on an equal footing. If the relation 
is regarded from a definite angle, e.g. y as depending on a:, then we 
use the term explicit function. Here, the variables are arbitrarily 
separated and y is called the dependent and x the independent variable. 
It is to be noticed, however, that a function is fiot a causal relation 
even from this latter point of view. There is no discrimination 
between the variables except for convenience, and one variable does 
not “ cause ** th<f other. Causal relations occur only between the 
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quantities of actual phenomena and, when such a relation is inter- 
preted by a fimction, it merely happens that one view of the fimction 
is dominant and the other neglected. Every mathematical function 
can be regarded from either point of view ; every function has its 
inverse. 

The concept of a function is an extremely wide one. The follow- 
ing examples serve to show this and will, also, help us to classify 
functions in a convenient way for detailed consideration. 

Ex. 1. The variables x and y take any numerical values which are 
such that the value of y is double that of x. In symbols 

is the expression of this implicit function. The two explicit and inverse 
functions are 

y=^2x and x = \y. 

The range of variation of x and y consists of all real numbers. 

Ex. 2. The variable y depends on the variable Xy which can take any 
numerical value whatever, according to the algebraic processes indicated 
by the symbolic expression 

y^x^^Zx-2. 

Here we have y as an explicit function of x and the range of variation of x 
consists of all real numbers. The implicit function from which it arises 
is expressible 

a;*+3a;-f/~2=0. 

The inverse function is more difficult to derive, but it is seen that, to each 
given value of y, there corresponds either a pair of values of x or no value 
of X at all. Solving the above quadratic equation for a given value of y^ 
we find (see 2.8 below) that 



'Skis gives x as an explicit function of y in which the range of values of y, 
for a real value of Xy must be limited so that negative values numerically 
greater than are excluded. 

Ex. 3. The variables x and y take any values whose product is 3. In 
symbols, the implicit function is 

and the twb explicit find inverse functions are 

3,3 
. y=- and a;=~. 

t ^ y 

Again, the range of a; or y consists of all re{».l numbera ‘ 
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Ex. 4. The variable y is defined as the result of the process 

y = 100 (1-05)* 

where the range of values of x is taken as consisting of positive integers 
only. The inverse function can be expressed only in terms of logarithms 
(see 10.2). This is an example of a function of a discontinuous variable 
and it can be noticed that it expresses the amount £i/ that results when 
£100 is left for x years at 5% interest compounded yearly. 

Ex. 5. The variables x and y are related by the equation 

an implicit function between x and y. The explicit functions are 
y—±Jl^-x^ and x=±J\^~y^, 

The range of values of x for a real value of y consists of all numbers not 
numerically greater than 4, and similarly for the range of y. Except 
when a; or 1 / is ±4, there are two values of y corresponding to each given 
value of X, and conversely. 

Ex. 6. An implicit function between x and y is defined by 

-\-y^ =0. 

The two explicit functions here cannot be expressed in any direct 
algebraic way. This does not mean that the functions do not exist. 
Given any value of a;, a “ cubic equation determines the corresponding 
values of y and we know (see 2.8 below) that there is either one value or 
three values of y, according to the value originally allotted to x. The 
dependence of y on x is definite and so is that of x on y. 


Ex. 7. Corresponding values of x and y are defined by the table : 


0<X:^3 

3< X 

4< X 

5< X <6 

6<x^7 

7<x<8 

... . .. ... 

8< X < 16 

y=6 

J/=7 

00 

II 

y=9 

3/ = 10 

y = ll 

t/ = 12 


Here, y is a function of x and the range of x consists of all positive nui^ers 
not greater than 15. On the other hand, when x is considered as a function 
of y, the range of the latter is discontinuous and consists only of the 
positive integers between 6 and 12. This function expresses the inland 
postage, y pence, as dependent on the weight of a parcel, x pounds (see 

P.O. Guide, 1936). A function of this nature is called a step function. 

% • • 

Ex. 8. The variable y is defined as the number of marks obtained by 
the candidate whose official number was x at the June Matriculation 
Examination of London University, 1936. Here, a table is necessary 
showing the nuiAbers of alkcandidates with the number of marks obtained 
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entered against each case. The range of x consists of those positive 
integers allotted as numbers to candidates who sat the examination, and 
the range of y consists of those positive integers corresponding to total 
marks obtained by at least one candidate. Both variables have discon- 
tinuous ranges. 

2.2 The graphs of functions. 

We have shown (1.8 above) how points in a plane can be located 
by means of numbers. Direct use can be made of this method in the 
graphing of functions. The practical process is described in text- 
books on algebra and we need give here only a general accoimt and 
a few examples of the process. 

From the definition of a given function, a table of corresponding 
values of the two variables x and y can be constructed and made as 
detailed as we wish. It is usually a matter of giving definite values 
to one of the variables and solving the formula of the function for 
the corresponding value or values of the other variable in each case 
If the range of variation is continuous, there is no limit to the 
number of entries that can be inserted. 

Co-ordinate axes Ox and Oy are ruled on a sheet of graph paper 
(with, e.g.. Ox horizontal and Oy vertical) and a definite scale 
of measurement is selected. Each pair of values from the table 
obtained from the function is plotted as a point on the graph paper. 
As more and more points are plotted, it is found, in the case of any 
ordinary function, that a freehand curve can be drawn through them. 
Either the collection of plotted points, or the freehand curve, is 
called a graph of the function. The following examples fully illus- 
trate the process. 

i!x. 1. y=a:*+3x-2. 

The following table of values of x and y is obtained from the formula 
of the function : 


X 

-4 

-3 

-2 

-4 

-1 

0 

1 

2 


2 

1 

-2 

i. 

-4 


-4 

-2 

2 

8 


Plotting the corresponding points, it is seen that a U-shaped curve can 
be drawn through them (Fig. 4). Between the points where x equals -2 
and -1, more detail is required than elsewhere and 6ne mtermediate 
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point has been plotted. It is found that the lowest point of the curve is 
at or very near to the point ( — I —IJ). The graph represents a curve 
known as the parabola (see 3.4 below). 


Ex. 2. xy = ^. 

The graph of the function can be plotted from the table : 


X 

-4 

-3 

-2 

-1 

0 

1 

2 

3 

4 

y 

-i 

-1 

3 

“ 2 

-3 

? 

3 

3 

2 

1 

i 


It is seen (Fig. 5) that a smooth curve can be drawn through the set of 
plotted points, except that there is some doubt about its course when v 
is small. When x is actually zero, the formula of the function provides 

1 



the values of y being taken to one decimal place. Outside the range 
Shown, there exist fio values of y B,t all. At a: =» ±4 there is a single value 
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of y while at other values of x there are two values of y. The graph of the 
function takes the form shown (Fig. 6) ; it is recognised as a circle of 
radius 4 units with centre at the origin of co-ordinates. 

Ex. 4. ~3a;t/ = 0. 

The table of corresponding values of x and y, from which a graph of the 
function can be drawn, now appears : 

-4-9 -3-9 -2-8 -1-7 0 0-7 1-3 1-6 1-9 2-9 3-9 

3-9 2-9 1-9 0-9 0 1-3 1-6 1*5 -2-8 -3-9 -4*9 

0-15 0*7 0-9 

-1*5 -2*25 -2*4 

Here, in order to obtain the values of y corresponding to any given 
value of X, it is necessary to solve a cubic equation. The values above 


Fia. 6. Fig. 7. 

are correct to one decimal place and are obtained by a graphical 
method described later (see 2.8 below). The set of points plotted from 
•^e table can be joined by a smooth curve (Fig. 7) consisting of a “ loop 
and two “ tails The graph represents a curve, known as the Folium 
of Descartes, of a more unusual type than those obtained in the 
previous examples. 

Ex. 6. The step function defined by the table of 2.1 (Ex. 7) can be 
plotted in a similar way. No smooth curve can be drawn through the 
plotted pdlnls which» can, in fact, be connected only by a set of seven 
disconnected lines parallel to the x-axis (Fig. 8). This is a function which 
will latcfr be described as discontinuous. 

A graph is clearly of great service in the cdhsideration of tile 
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properties of particular functions : it provides, at least, a visual repre- 
sentation of the related variation of the two variables. For example, 
the varying height of the curve shown on the graph (above or below 
the horizontal axis Ox) indicates the way in which y varies as a; is 
changed in a definite way. The graph can also be used to replace 
numerical calculations, provided 
that approximate results suffice. 

In plotting the graph, only certain 
convenient (e.g. integral) values 
of the variables are used and, 
once this is done, corresponding 
values of the variables at inter- 
mediate positions can be read off 
approximately from the curve 
shown. This method of grapliical 
“interpolation is used in the sol- 
ution of equations (see 2.8 below) and in many other connections.^ 

The method of graphing we have described illustrates the relation 
between two variables connected by a given mathematical function. 
But the same method can be used when the variables represent 
physical or other quantities given by observed phenomena and when 
we know that no “ perfect ” functional relation can exist between 
them, either in the nature of things or because some suspected rela- 
tion is disturbed by random deviations or “ errors Here we obtain 
what can be called a “ statistical rather than a functional, graph. 

Two cases can be profitably distinguished. One of the observed 
quantities x may be ordered in space or time and so that a single 
value of the other observed quantity y corresponds to each value 
of x. We take x along the horizontal and y along the vertical a*is 
and the graph shows a series of points proceeding from left to right, 
varying height illustrating the variation of the observed y as the 
observed x varies according to its ordered progression. The graph 

* For example, the graph of y = sjx can be plotted by giving x values 
which are perfect squares. The values of square roo^s such as i^/2 or ^3 can 
then be obtained approximately as the heights of the graph corresponding to 
(e.g.) a: =2 or 3. Alternatively, the graph of y=x* can be plotted, using 
integral values of a?, and the approximate values of (e.g.) s/2 and v*® can then 
be read off the points on the axis Ox corresponding to heights of 2 and 3 on 
• the graph. See Hegben, Mathematics for the Million (1936), p. 415. 
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may show, for example, the varying observed volume of a gas at 
constant temperature as the observed pressure increases over time. 
From this graph, we could see whether, apart from errors and 
random deviations, a definite fimctional relation between volume 
and pressure can be reasonably assumed. Again, a graph can be 
used to show the value of imports into the United Kingdom over a 
period of years, or the variation in some other statistical time 
series (see 10.4 below). In these cases, no functional relation (e.g. 
between imports and time) can exist and the graph is mainly 
descriptive. 

On the other hand, neither of the two observed quantities may 
possess a natural spatial or temporal order. We may have, for 
example, a series of observed values of the market price (p) of a 
commodity and the corresponding market consumption (g). Such 
a correspondence can be obtained by taking a succession of yearly 
quotations of p and q (see Examples 1, 30). Each pair of values of 
p and q can be plotted as a single point referred to two axes Op and 
Oq and the whole set of observations is then represented by a 
graphical ‘‘ cluster ” of points. Such a graph is called a “ scatter 
diagram and its uses, at least for purposes of illustration, are 
evident. It may be suspected, for instance, that there exists a 
definite relation between p and q, a relation which is disturbed by 
errors and other deviations in the actual observations. The correct- 
ness of this supposition can be examined and, if a relation is found 
to exist, its nature can be discovered. 

2.3 Functions and curves. 

The graphical method of representing functions can be extended 
tOjpstablish a general connection between functions and curves. If 
we are given a functional relation between two variables x and y of 
continuous variation, then there is no limit to the number of entries 
that can be inserted in the table of corresponding values of the 
variables and, hence, no limit to the number of points that can be 
plotted in the plane Oxy, The function gives rise, therefore, t<5 an 
indefinitelyl^rge nuMiber of points in the plane and only a few of 
them are shown in any actual graph. Just as the variables are 
related iH an ordered way by the function, so the corresponding 
points must display a definite characteqptic, i.e. must make up a« 
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locus or curve in the plane Oxy. Hence, to each given function relating 
variables x and y, there corresponds a set of points comprising a 
curve in the plane Oxy ; the analytical property defined by the 
function is reflected in the geometrical property common to all points 
on the curve. 

Conversely, a curve, as a geometrical concept, is a collection of 
points in a plane with a common characteristic. Fixing co-ordinate 
axes and attaching co-ordinates to each point of the plane, the 
property defining the curve can be translated into an analytical 
relation between x and y satisfied by 
all points (x, y) on the curve, a 
relation which is called the equation 
of the curve. Consider, for example, 
a circle with radius 4 units and 
centre at a fixed point 0 of a plane, 
i.e. the curve consisting of all points 
distant 4 units from 0. Fix co- 
ordinate axes to pass through 0^ let 
P with co-ordinates (x, y) be any 
point on the circle and drop per- 
pendiculars PM and PN to the axes 
(Fig. 9). From Pythagoras’ Theorem 

0P2 = OM^ -f JfP2 =. OM^ -h ON'^ = +t/*, 

since OM =x units and ON —y units. But OP must equal 4 units for 
all positions of P on the circle. Hence, 

is the relation satisfied by all points (x, y) on the given circle, i.e. the 
equation of the circle (cf. 2,2, Ex. 3, above). ^ 

The correspondence between the functions of analysis and the 
curves of geometry is thus a perfect one. To each function relat ing 
variables x and y there cor responds a definite curve i n the plane 
Oxy, and conversely . It is important to remember, however, that 
the uniqueness of the connection depends entirely on the fixing erf 
co-ordinate axes in the plane. A given curve hj^s differerrf equations 
when different axes are selected. 

It is now clear that a graph of a given function is simply an actual 
jpepresentation oi^ squared paper of the curve which corresponds to 
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the function. The function is an abstract concept relating two 
variables x and y ; the curve is an equally abstract concept relating 
points in two-dimensional space. If the variables are continuous, 
there is an indefinitely large number of pairs of variable values and 
of points on the curve. A graph, on the other hand, consists of a 
finite collection of points selected arbitrarily from the indefinitely 
numerous set possible and designed to indicate clearly the shape 
of the curve. But the curve itself remains an abstract concept, 
of which the graph is only a visual, and more or less imperfect, 
representation. 

Finally, notice that a curve, as defined above, need not be 
“smooth” in the sense that it can be shown graphically as a 
continuous line without breaks or angles. Nor need the curve be 
described in any temporal way by a moving point. It is often 
convenient to think of a point moving along a curve, but this is 
only a device to facilitate exposition. A curve, like a function, is 
defined and considered as a whole, as a collection of points in a plane 
united by a common characteristic. 

2.4 Classification of functions. 

We can now proceed to distinguish functions of different kinds 
according to the nature of their definitions or of their symbolic ex- 
pressions. It is necessary, for this purpose, to introduce a number 
of technical but very useful terms, 

A function is analytical if it can be expressed in symbols by moans 
of a single formula connecting the variables, a formula which is the 
“ general law ” behind the function. From the formula, as we have 
seen, values of one variable, corresponding to given values of the 
o^er, can be derived by carr3dng out the algebraic or other 
operations indicated. The first six examples of 2.1 give analytical 
functions ; the other two functions are not analytical. 

The next distinction is an extremely important one. Here, we 
regard functions as explicit, taking either y as a function of a; or a; as 
a fimction of y. Explicit functions can be divided into two classes, 
those which are singje-valued and those which are multi-valued. A 
function such that one and only one value of y corresponds to each 
given value of a; is said to define y as a single-valued function of x. 
A function for which this is not true defines y ^ a multi-valued, 
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\function of x, and we have double-valued, triple-valued and higher 
fvalued functions. The inverse function, a; as a function of y, can be 
, considered in the same way. 

The functions of the first four examples of 2.1, as well as those of 
the last two examples, all give y as a single-valued function of x. 
In the fifth example, t/ is a double-valued function, and in the sixth 
example i/ is a triple-valued function of x. On the other hand, only 
in the first, third and fourth examples is a; a single-valued function 
|bf y. This introduces a very important point ; the inverse of a single- 
valued function is not necessarily itself a single-valued function. This 
is clear from the inspection of even such a simple function as that of 
the second example of 2.1. 

In graphical terms, if i/ is a single-valued function of x, then the 
curve corresponding to the function is cut by any line parallel to Oy 
in only a single point. But, if i/ is multi-valued, then the curve can 
be cut by such lines in two or more points. In the same way, a; is a 
single-valued (or multi-valued) function of y if the curve is cut by 
lines parallel to Ox in only a single point (or in several points). There 
is no necessary connection between these two ways of finding the 
intersections of a line and the curve ; a given curve can be cut by a 
line parallel to one axis in only a single point while lines parallel to 
the other axis cut it in two or more points. The single- valued 
property does not necessarily apply to both the inverse explicit 
functions. 

One reason for the importance of single-valued functions is clear. 
When 1 / is a single-valued function of x^ we can take the function 
as described by values of x which increase steadily in value ; the 
independent variable can then be completely ignored and attention 
concentrated upon the variation in y. The value of y may be found 
to increase, to decrease, to have alternating increasing and decreasing 
“ stretches ’’ or to vary in a more or less erratic way. Graphically, 
we take x along the horizontal axis and describe the curve from left 
to right. The varying height of the curve above or below the hori- 
zontal shows the variation of y, and we can see when the curve risas 
(y increasing) and when it falls (y decreasing).^ • ^ 

The last remarks lead to the definition of a particularly useful 
sub-class of single-valued functions. If y is a single-valued function 
^ of a continuous variable x, and if y increases in value as x increases, 
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' then y is called an increasing function of x. Similarly, if the value 
of y decrease^as x increases, we have a decreasing function of x. The 
class of increasing and decreasing functions, taken together, com- 
prise what are called monotonic functions. The graph of a monotonic 
function of Xy the axis Ox being taken as horizontal, either rises or 
falls without interruption from left to right, rising for an increasing 
function and falling for a decreasing function. 

Subject to minor qualifications concerning the ‘‘ continuity ” of the 
function (4.6 below), a significant property of a monotonic function 
can be established. The inverse of a single-valued and monotonic 
function is also single-valued and monotonic in the same sense as the 
original function. For example, if y is a single-valued increasing 
function of Xy it follows that only one value of x can correspond to 
each value of y and that this value must increase as y increases. 
This is quite clear graphically, since the curve representing j/ as a 
single-valued increasing function of x (axis Ox horizontal) rises 
throughout from left to right and so the distance of the curve from 
the vertical axis increases as we proceed upwards. It is thus only 
in the case of monotonic functions that we can always say that both 
inverse functions are single-valued. 

The first and third examples of 2.1 show functions which are 
monotonic, the first being an increasing function and the other a 
decreasing function. In each case, what is true of y as a function 
of a; is also true of a: as a function of y. The second example of 2.1 
provides a case of a single- valued function which is not monotonic. 
As X increases from large negative values up to a; = ~ |, the value of 
y decreases, whereas y increases as a; is increased beyond a;= ~f. 
The graph falls and then rises. The inverse function, a; as a fimction 
of^, is not single-valued, but double-valued, here. 

Returning to the case of multi-valued functions, it is sometimes 
possible to divide such a function into two or more “ branches 
each branch being a single- valued function. The corresponding 
curve is also divided into two or more distinct sections, each cut by 
li^es parallel to the appropriate axis in only a single point. The 
function of^he fifth example of 2.1 is a case in point. The explicit 
expression, giving y as a double-valued function of Xy is 


y= 
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which can be divided into two single- valued functions, 
y=4.>/l6-a;2 and 

The corresponding curve, a circle, is likewise divided into two single- 
valued sections, one above the axis Ox and the other below. In the 
same way, the second example of 2.1 gives a; as a double-valued 
function of y divisible into the two branches 

a; = J(N/4y -1-17 “3) and - |(V4?/ +17 + 3). 

The corresponding curve, a parabola, is divided into two sections by 
a line parallel to Oy at a; = - f , each section being cut in only a single 
point by lines parallel to Ox (see Fig. 4). The first branch of this 
function is an increasing function of y, the corresponding section of 
the curve rising from left to right ; the second branch is a decreasing 
function of y and this section of the curve falls from left to right. 
Such divisions of multi-valued functions into single- valued branches 
are clearly very useful. 

Finally, it can be noticed that some functions can be described as 
aymmetricah Symmetry can be defined in various ways, of which 
the following is perhaps the most important. An implicit function 
relating variables x and y is said to be symmetrical in x and j/ if an 
interchange of these variables leaves the function unaltered in form. 
This implies that y as an explicit function of a; is of exactly the same 
form as a; as an explicit function of y. The function a:y = 3 is sym- 

3 3 

metrical #n this definititn and the inverse functions, y =- and a:=- , 

^ y 

are seen tf be of identical form. Graphically, such a symmetrical 
function is represented by a curve symmetrical about lines bisecting 
the angles between the axes. The part of the curve on one side of the 
line is the reflection in the lino of the part of the curve on the othej;^ 
side. The graph of a;y = 3 illustrates this fact (see Fig. 6). 


^.5 Function types. 

So far we have considered only a number of particular functions 
such as appear in the simplest operations of algebra. Our next taslr* 
is to group functions into types and to devise a symb(51ifr notation 
to include aU functions of one type in a single formula. The grouping 
of functions into types proceeds by means of the notion of aifalytical 
“ form This notion, which must be “ sensed rather than defined, 
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is essential to the extension of the processes of algebra into the more 
powerful methods of analysis, and its nature is indicated most easily 
by giving concrete examples. 

The explicit function j/ = 2a: is clearly but one example of a wide 
range of functions of the same form Other examples are 

y=^Zx\ f/ = 6a;4-2; j/=a;-3 and y = \-2x. 

All such functions can be included in the single formula 

y=ax -f 6, 

where a and 6 denote any definite numbers, positive, negative or zero 
The function type, represented by this “ portmanteau ” formula, 
is described as an explicit linear function of x* In the same way, 
we can define a; as a linear function of y. Further, putting the linear 
function into implicit form, we can write 

ax -|-c = 0, 

where a, b and c stand for any three numbers whatever. The implicit 
linear function gives rise to two explicit linear functions : 

the coefficients of which are expressed in terms of the three coeffi- 
cients of the original implicit function.f 
As a second example, we see that the explicit function 

y=x^-{-3x-2 

is one example of the function type, called the explicit quadratic 
function, which can be symbolised 

y=zax^ +bx -fc, 

where a, b and c denote any definite numbers. In the same way, we 
can have a: as an explicit quadratic function of y. Putting either of 
these functions into implicit form, it appears as a “ polynomial ” 
expression, containing no powers or products of the variables of 

♦ The term “ linear is used since, as we see later, such a function is^repre- 
sented grajirhfcally hyta, straight line. 

t The cases where a orb equals zero are a little troublesome (see 3.3 below) 
but need not delay us here. Further, it may seem odd that there are three 
coefficients in the implicit form and only two in the explicit form. This is, 
however, only a matter of convenience (see 3^.7 below). ^ , 
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higher degree than the second, equated to zero. Other implicit 
functions are also of this form. The function type xy~a, of which 
xy^3 is one case, and the function type ax^ +by^ = c, of which 
[q instance, are examples. We derive, therefore, a 
wider function type, symbolised by the formula 

ax^ -\-2hxy -hby^ +2gx +2fy +0 = 0 , 

; Which can be called the implicit quadratic function relating the 
variables x and y. This includes, for certain values of the coeffi- 
cients a, 6, c, /, g and h, the explicit quadratic function and also 
many others. The grouping of functions into types is not necessarily 
unique ; there are several overlapping types possible according to 
the particular point of view adopted. 

The number of different function types that can be distinguished 
is indefinitely large. Amongst algebraic ’’ functions, we can go 
on to separate cubic ” functions of explicit and implicit form, 
quartic ” functions, and so on. There are also numerous functions 
of forms we have not yet introduced and some of these types will 
be considered at a later stage. The exponential, logarithmic and 
trigonometric functions are notable instances. 

The grouping of functions into types, largely a matter of con- 
venience, is characterised by the use of symbolic letters other than 
those standing for the variables themselves. The familiar algebraic 
device of representing unspecified numbers by letters has been signi- 
ficantly extended and, before proceeding, it is essential to get some 
idea of this extension. The symbolic letters, such as a, 6 and c in the 
above formulae, must be “ variable ’’ since they stand for unspeci- 
fied numbers, but there must be some difference between them and 
the original variables x and y. This difference, on which the who^^ 
grouping of functions into types depends, must be made clear. 

To fix ideas, consider the quadratic function type which we can 
write j/=aa;2 +6a: -f-c. Two points of view can be adopted. From 
the first point of view, we investigate the relation between the vari- 
ables X and y and the numbers a, b and c must be taken as fixed, axi 
having the same numerical values whatever they may^e. From 
the other point of view, the function is treated as a whole and the 
type includes a whole set of particular functions united by the 
^common property of bein^ “ quadratic The formula, in fact, 
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represents any function of quadratic form. Here, x and y cease to 
be of importance and the whole function is made to vary by changing 
the values of a, 6 and c. These latter numbers are fixed within one 
function of the quadratic type but variable from one quadratic 
function to another. 

Hence, the characteristic of letters such as a, b and c is that they 
lead double lives ; they are “variable constants To suggest this 
double use, we describe such symbolic letters as parametric constamU 
and one of two shorter terms can then be used according to the point 
of view. When attention is directed to the relation between the 
variables of the function, the symbols denote fixed numbers and can 
be termed constants. When the important thing is the variation of 
the function as a whole, the variables of the function fading out of 
the picture, we describe the symbols as parameters,* 

We have tried to show how the functional notation is made 
more general and flexible by the use of parametric constants. The 
quadratic function type, for example, becomes a particular function 
once numerical values are allotted to the parameters. But, taking a 
more general view, we can attempt to derive properties, not of one 
quadratic function, but of all quadratic functions. Or we may know 
that some quadratic function satisfies a given condition and then 
proceed to determine the values of the parameters for which the 
condition holds. This, for example, is the idea behind the fitting of 
a “ parabolic trend ’’ to a statistical time-series. 

A simple algebraic instance can be quoted. It is required to find 
the height of an open box of square base of side 4 inches so that the 
surface area is 48 square inches. If the height is x inches, the surface 
area {y square inches) is a function of x : 

** y = 16(a:+l). 

Putting y = 48, the value of x is found to be 2 inches. This problem 
can be generalised and solved for all cases by the use of parameters. 
If y square inches is the surface area of a box of square base of 
given side a inches and of variable height x inches, then • 

"• • • 1 

•y=a*-f4a« and x——{y-a^). 

*xa 

/ * The term parameter comes from TrapoLfierpecj which means “ to compare ” 

or “ to measure one thing by another • • 
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This is a functional relation between x and y. If the surface area is 
known to be 6 square inches, then 

b -a^ . 1 
« = — — inches. 

4a 

We have solved all problems of this type, and the solution of any 
particular problem is derived by allotting the parameters a and b 
the appropriate numerical values. 

Finally, the use of parameters is carried over into the mathe- 
matical sciences, as is seen, for example, by the parametric role of 
market prices in economic theory. A firm (under conditions of “ pure 
competition ”) must take as given the market prices that happen to 
exist and we can assume its output is known for each given price 
system. In combining the output decisions of all firms, we treat the 
price system as variable and attempt to discover a set of prices con- 
sistent with equilibrium (however the latter is defined). Prices are 
parametric constants, constant in the examination of the decision 
of a single firm, parameters when we combine the separate decisions 
of a whole group of firms. 

2,6 The symbolic representation of fimctions of any form, 

A further generalisation of the functional notation now takes us 
outside the limited field of algebra, to which we have been confined so 
far, into the wider territories of modern analysis. The idea involved 
is simple. Just as a symbol is introduced in algebra to stand for 
any unspecified number, so now a symbol is introduced to stand 
for any function. In algebra, we deal with the properties of functions 
of particular types ; with the new notation, mathematical analysis 
goes far beyond this and introduces more powerful processes, such 
as those of the calculus, to deal with functions of any kind whatever. 
The development of mathematics, here as elsewhere, is dependent on 
the development of its notation. 

I An implicit function relating two variables x and y can be repre- 
sented, whatever its form may be, by the notation 

f(x,y)=0. ^ 

This is certainly appropriate for any analytical function, the sym- 
bolic /(x, y) standing for some expression involving x and y. To 
dbtaiQ) from the general notation, a particular function, we need 
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only specify the exact form off{x, y) ; for example, if we take/(a;, y) 
as 2a: a;* - 16 or - 3a:y, we obtain a particular function 

y = 2a:, a:*+t/* = 16 or a:* 4 * 2 /^ - 3a:t/ = 0. The notation can also be 
applied, without confusion, to include even non-analytical functions. 

Again, if y is an explicit function of x of any form whatever, we 
can denote it by the notation 

y=fM- 

This notation is clearly applicable to the case of a function which is 
single- valued and analytical.* In such a function, y is defined as 
equal to the value of a certain expression involving x, and /(x) can 
be taken as a convenient way of representing this expression in x, 
no matter what its form may be. To derive a particular function, 
we supply a definite form for /(x) ; if we take (e.g.) f(x) as 2x, 
3 

a;* -f 3a: - 2 or - , we obtain three well-known single- valued functions. 

X 

The notation can also be extended to apply to functions which are 
not single- valued or even analytical; in its broadest use, y=f{x) 
signifies only that we are taking y as some function of a:. Notice that, 
in this notation, we need not refer to the variable y at all ; we can 
say that f{x)iaa, function of x. For example, we often describe 
(x^ 4- 3a; - 2) as a function of x. This is a matter of convenience only ; 
a function always relates the values of two variables whether we 
care to suppress reference to one of them or not. 

Since the importance of a good notation, both general and flexible, 
is evident, we can insert here a number of remarks on the symbolism 
of mathematical analysis. Letters are used to denote both variables 
and parameters, and, in order to distinguish one from the other, it 
IB usual to reserve the later letters of the alphabet for variables and 
the earlier and middle letters for parametric constants. The letters 
x, y, 2, Uy V and t are most frequently used for variables, and, if these 
do not suffice, the Greek letters rj and ^ are called into service. 
When parametric constants are regarded primarily as constants, 

♦ An alternative, and in some ways a superior, notation is occasionally used. 
Instead TSf writing y —f{x) to denote the single-valued function, we write 
y^y(x). This economises letters and enables us to write the dependent 
variable by the single letter y when the dependence on a; is not stressed, and 
by the seime letter with x in brackets when the dependence on x becomes 
important. For an actual use of the notation in economics, see Frisch, 
'New Methods of Measuring Marginal Utility f (1932). * 
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the letters a, 6, c, , or the corresponding Greek a, j8, y, ... , are 
conventionally used. When the parametric property is the more 
important, it is often useful to take Z, m, n, p, q and r, 
or /c, A, ft, V and p, to denote the parameters. These remarks 
serve only as general guides ; the context should be sufficient, in 
any particular case, to make clear the nature of the symbols used. 

The letters denoting functions are of different nature from the 
variable or parametric letters ; they do not denote anything to 
which numerical values can be allotted but stand for the complex 
notion of the form of a function. It is usual to reserve the letters 
/ and g, and the Greek <f> and tp, together with the corresponding 
capitals, for the denotation of functions. It is also possible, for 
greater variety, to add suffixes to the functional letter, /j, /,, ... 

and </»i, (f) 2 i ••• being examples. 

It is, of course, essential to denote different variables, parameters 
or functions by different letters. For example, if we wish to say that 
any implicit function gives rise to two explicit and inverse functions, 
we must symbolise somewhat as follows. From the given implicit 
function / {x, y) = 0, we derive the two ftmctions y = <f){x) and x^\p(y) 
which are inverse to each other. 

Functions refer to operations performed on variables, and it is 
thus necessary to denote a function, not only with its operational 
or functional letter, but also with an indication of the variable or 
variables to which the operation applies. The notation f{x) for an 
explicit function makes this clear, the letter / being the functional 
letter and the letter x in brackets denoting the variable to which 
the operation applies. Other operations, such as those of derivation 
and integration, will appear later and will be denoted by further 
sjmabolic devices of the same kind. 

One more notational device remains for consideration, i.e. the 
method of introducing particular values of the variables into a 
general function. Suppose y =/(-; is some explicit function in which 
we give, to the variable x, definite numerical values such as 
x = 0,^x = ^, a; = l. The corresponding values of y are then written 
/(O), /(J), /(I). Further, we can allot values t6 x ^hich are fixed 
but not specified. Such values of x can be denoted by a, 6, c, , or 
by the letter x with suffixes or primes added : 

x^y -v or x\ x’\ .... 
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The value of y when x is given the fixed value (e.g.) a or is written 
f(a) OTf{xj), These fixed, but unspecified, values of the independent 
variable are, of course, instances of parametric constants. To illus- 
trate with a particular function, if 

f(x) - 2 , 

then /(O) = (0)2 +3 (0) - 2 = - 2 ; /(|) = (J)2 + 3 (J) ~ 2 = ~ i, 

/(1)=:(1)2 +3(1) -2=1:2 ; and so on. 

Further, f{a) = + 3a - 2. 

Again, i{f{x, j/)=0 is an implicit function, then /(a, f/)=0 indicates 
an equation to be solved for the values of y corresponding to x = a, 
and/(x, 6)=0 an equation giving the values of x corresponding to 
y=b. Specific numbers can, of course, be substituted for a or 6. 

As an example in the use of the functional notation, consider the 
following operations of wliich much use will be made later. If a 
and (a +A) are two particular values of a variable a;, where a and h 
are any positive or negative values, then /(a) and /(a+A) are the 
corresponding values of a function /(a:). The change (or increment) 
in the value of the function corresponding to the given change in a: is 

/(a +A) -f{a). 

The change in the function per unit change in x is then 

/(g +fe) -/(g) 

h 

and this is defined as the average rate of change of the function when 
X changes from a to (a +A). The actual changes shown in x and 
f(x) can be positive or negative according as they are increases or 
decreases. 

0.7. The diagrammatic method. 

We have seen that a definite curve referred to selected axes in a 
plane corresponds to each particular function wo care to define, and 
conversely. Further, when we are dealing with definite functions 
and curves, the graphical method^ in which the graph of a function 
or curve is^ plotted on squared paper, is of great service. But it 
is easily seen that‘s the connection between functions and curves 
remains of the greatest importance when we pass from particular 
functions to function types. It is reasonable to suppose, and it is in 
fact true, that the curves corresponding to a set of functions of the 
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same type form a class of the same general nature. To a function 
type there corresponds a curve class, and properties common to 
all functions of the type are paralleled by properties common to all 
curves of the class. This correspondence is investigated in the branch 
of mathematics known as analytical geometry ^ where we consider 
what function types correspond to particular curve classes and what 
analytical properties of the functions reflect various geometrical 
properties of the curves. Some account of analytical geometry will 
be given in the following chapter. 

The function-curve correspondence would appear to be useless, 
however, when we come to “ arbitrary ” functions of unspecified 
form. All that can correspond to an arbitrary function is an arbi- 
trary curve, and neither graphical methods nor analytical geometry 
avail. But it always happens, in our analysis, that functions, 
though otherwise arbitrary, are limited by certain general properties. 
It may be, for example, that a function is limited by the conditions 
that it is single-valued, positive and decreasiag, or that it decreases 
up to a certain point and then increases. The curve which corre- 
sponds to the function, though still largely arbitrary in position, 
now displays sufficient properties to make the drawing of an illus- 
trative diagram worth while. In the first of the above instances, 
the diagram would show a curve lying completely above the hori- 
zontal axis and falling steadily from loft to right. 

It is here that the diagrammatic method of illustrating analytical 
arguments becomes important. The least service diagrams can 
perform is to illustrate and check our analytical development. It is 
possible, in some cases where the properties of our functions are very 
definite, to make more positive use of diagrams by employing them 
in conjunction with simple geometrical arguments. But we must be 
careful to avoid mistaking casual properties of the curves in our 
diagrams for essential general ones, and to prevent ourselves accept- 
ing apparently self-evident results without formal proof. 

It is to be noticed that one curve serves to represent a given 
fimctipn, whether the latter is expressed in implicit form or in either 
of the inverse explicit forms. We need examine^ the shaptr'of only 
one curve to illustrate properties of a functional relation between 
X and y, of t/ as a function of x and of a; as a function of y. To see, 
f(jr example, how njany values of y correspond to a given value of a:, 
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and conversely, we look for the number of points of intersection of 
the curve and lines parallel to the axes. The use of the diagram- 
matic method on these lines is illustrated in 2.4 above. 

A minor difficulty in the use of diagrams concerns the fact that 
we can fix the co-ordinate axes in two different ways, even when we 
adopt the established convention of drawing one axis horizontal. 
A given curve may look quite different when one variable is measured 
along the horizontal axis than it does when the other variable is so 
measured. Since it is important to be able to recognise the same 
curve however the diagram is drawn, we can indicate here a way in 
which a curve shifts when the axes are transposed. Fig. 10 shows 



the curve corresponding to a given function relating x and y, firstly 
when Ox is horizontal and secondly when Oy is horizontal. The two 
positions of the curve are such that one position is the “ reflection 
of the other in the bisector of the angle between the axes, i.e. in the 
broken line shown in the figure. Hence, if we start with the curve 
in the first position and rotate the plane containing it through 180° 
about the bisector indicated, the new position taken up by the curve 
^ the position with axes interchanged. 

2.8 The solution of equations in one variable. 

As an exercise in the use of analytical methods supported by 
graphs or diagrams, we can consider here the important question of 
the solution of equations, to which many mathematical prioblems 
reduce.^ *Ah equation in a single variable x can be written, in general, 
B»f{x) =0, where f{x) denotes some given expression or function of x. 
Our problem, in solving the equation, is to find the value or values 
of X which make the value of f(x) equal to zero.., ^ 
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The simplest type of equation in one variable occurs when f{x) is 
a polynomial, i.e. the algebraic sum of terms which involve only 
various powers of x, and such equations can be classed according to 
their degree, as determined by the highest power of x involved. 
There are linear equations (of the first degree), quadratic equations 
(of the second degree), cubic equations and so on. A solution of such 
an equation, a value of x which makes the polynomial zero, is 
iisually called a root of the equation. The process of finding roots 
of polynomial equations, at least in simple cases, is fully described 
in text-books on algebra. 

The general linear equation is ax-\-h — 0 and this has one root, 

X = - - , in all cases. The general quadratic equation is 
a 

ax^ +bx -f c = 0, 

of which the solution is known to be 

-b ±slb^-4:ac 



It follows that there are two real roots if 6^>4uc, and no real root 
if b^<4ac. If b^ = 4ac, then the two real roots coincide and there is 
only one value of x satisfying the equation, which reduces, in this 
case, to a perfect square equated to zero. The formula provides the 
roots indicated in the following examples of quadratic equations : 
2x2 -X- 1=0, a:=~| and x = l, 

x2-3x 4-1=0, x = 1(3+75) and x = 
x2 +x + 1 = 0, no real roots. 

In the first example, the roots can also be found by a simple process 
of factorising the quadratic expression. 

Cubic and higher degree equations present much more difficulty. 
It is occasionally possible to solve such an equation by factorisation, 
as in the case of the cubic equation 

x2 - 2x2 _ 2a; +1 =0. 

By trial and error we see that x = - 1 satisfies the equation. So one 
root is^-1 and (a;+l) must be a factor of the cubic expression. 
Taldng out this factor, the equation becomes ^ • *• 

(x +I)(x2~3x +1)=0. 

The other roots of the cubic are thus those of the quadratic 

x2-3x +1=0. 
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The complete solution of the cubic then gives three roots, 
x=-l, a; = |(3 4-V5) and a: = J(3-^5). 

In general, however, analytical methods of solving cubic and 
higher degree equations are difficult to devise and will not be con- 
sidered here. Instead we can indicate a graphical method of solving, 
not only a polynomial, but also any given equation. By its nature, 
the method can only produce approximate results, the approxi- 
mation being closer the more accurately the graphs are drawn. The 
following examples suffice to illustrate the method. 

Ex. 1. ic* - 2a;* - 2a; + 1 =0. 

The graph of the function y = a;* - 2a;* - 2a; + 1 is constructed as shown 
in Fig. 11, where (for convenience) different scales for x and y are adopted. 




The roots of the equation are those values of x which make y zero, and so 
‘must be obtained from the points where the curve of the graph cuts Ox, In 
this case, it is seen that the plotted curve cuts Ox in three points and the 
approximate values of the three roots are read off as 
a;«-l, a;=0*4 and a; = 2*6. 

These are the values, to one decimal place, of the exact roots already 
found. 

Ex. 2L -3a; -3=0. 

Fig. 12 shows the graph of the function y-x^ -3x -3, and the curve 
plotted is seen to cut Ox in one point only where the value of x is a little 
greater than 2. The cubic equation has thus only one root which is 
approximately equal to 2.1. , t • 
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Ex. 3. A graphicjal method slightly different from that of the previous 
examples can also be used. Taking the cubic equation of the last example, 
we can write it in the form 

a:* = 3a;+3. 

Fig. 13 shows the graphs of the two functions y and y = 3a: + 3, plotted 
on the same graph paper and using the same scales. The first function 
gives a well-known curve and the second a straight line. At any point 
where the curves intersect, the 
values of y are equal, i.e. the corre- 
sponding value of X satisfies the 
given cubic equation. Fig. 13 
shows that there is only one point 
of intersection and that the only 
root of the equation is approxi- 
mately 2*1 (as before). 

Hence, to solve any given 
equation f(x)=0 approximately, 
we need only plot the graph of 
y^f{x) and determine where it 
cuts the axis Ox. Or, we can 
split the equation so that it appears in the form <f>{x) = t(f{x), and 
then find the a:-co-ordinates of the points where the graphs of 
y = <l>{x) and y = ilj(x) intersect. In either of these ways two things 
are determined, the number of the roots of the equation and the 
approximate value of each root.* 

Notice that the first piece of information is often of use quite apart 
from the second. If this is all we require, the graphical method can 
be generalised into a diagrammatic method of determining the 
number of roots possessed by an equation of given type. For 
example, any cubic equation can be written in the form 

x^=ax^ -\-bx -fc, 

where a, 5 and c are constants. The curve y =a:* has been plotted in 
Fig. 13. The quadratic function y= ax* +6x +c (a 7*^0) is represented 
by a UT-shaped curve (known as the parabola, see 3.4 below)^ which 

* Various numerical methods of improving upon the approximate roots 
obtained graphically have been devised. The best known is that of Homer ; 
see Burnside and Panton, The Theory of Equations, Vol. I (8th Ed., 1918), 
pp« 225 et seq. 
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cuts the first curve either in one point or in three points.* The cubic 
equation, therefore, has sometimes one real root and sometimes three 
real roots according to the values of the coefficients. 


2.9 Simultaneous equations in two variables. 

From a single equation in one variable, we pass now to the next 
simplest case where two equations are given connecting two variables 
X and y. In general, the equations can be written 
A y)=0 and /a (a:, y) = 0, 

where /j and /g denote two given functional expressions. Such simul- 
taneous equations are said to be solved when we have determined 
a pair, or a number of pairs, of values of x and y satisfying both 
equations. The simplest analytical device of solving simultaneous 
equations is to obtain, from one equation, an expression for one 
variable y in terms of the other variable x, and to substitute this 
expression into the second equation. The result is an equation in 
X only, which can be solved in the ways already indicated. The 
following examples illustrate the method. 


Ex. 1. a; + f/-3=0 and a;~3y + l=0. 

From the first equation, we obtain y — Z-Xy and, on substituting in 
the second equation, we find that a; =2. The corresponding value of 
y = 3~a; = l. The simultaneous equations have the unique solution 
x-2 and y — l. 

The result can be generalised. The general pair of simultaneous linear 
equations 

aia; + 6 i 2 /+Ci =0 and 02 ^+ 622 / 4 - 03=0 


has a unique solution 


6 jC 2 — ^2^1 


and 


^ ** 1 / == 

(l -^2 ~ ^163 “ ^^2^1 

Ex. 2. 2a; + 2 / -1=0 and a;^ + 2 /^ ~ 3 a; 2 / = 0. 

The first equation is linear and gives y = 1 - 2a;. Substituting into the 
second equation and simplifying, we obtain 


C^(t2 ~ ^ 2^1 


7a;* -18** + 9a: -1=0. 

This cubic equation is found, by graphical methods, to have three roots 
which i^re approximately equal to * 

• *^a;=0*2, a; =0*45 and a; = 1*8. 


• A similar result holds when a—0. Hero the second curve is a straight line 
representing y=6a?+c, which again outs the first curve in one or in three 
points. (Cf. Fig. 13.) , • 
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The corresponding values of from ^ = 1 - 2a;, are 

y = 0-6, y=0*l and y= -2*6. 

There are, therefore, three solutions of the given simultaneous equations. 

A direct graphical method can be used to give the approximate 
solution of any specified pair of simultaneous equations, fi(x, y)=0 
and/g {Xy y) = 0 . We plot the graph of each of these implicit functions, 
using the same pair of axes Ox and Oy and the same scales in each 
case. At any point of intersection of the curves of the graph, the 
co-ordinates give values of x and y satisfjdng both the equations, 
i.e. give a solution of the simultaneous 
equations. The graph thus tells us 
how many solutions there are and 
gives us approximate values of x and y 
for each solution. 

To illustrate the graphical method, the 
equations of Ex. 2 above can be used. 

Fig. 14 shows the graphs of the implicit 
functions 

2x-¥y -I =0 and a;* -f-y* -3a;y =0. 

The first curve is a straight line and the 
second a looped curve which has been 
plotted already (in 2.2 above). There 
are three points of intersection, A. A 
and ^3. which provide, by reading off 
the co-ordinates, the three pairs of values 
of X and y given above. 

If the simultaneous equations are not given in specified form, then 
the graphical method breaks down. But a diagram can still be ^ 
drawn, in many cases, to indicate the number of solutions, but not 
their numerical values. For example, the solution of any linear 
equation aa; 4-6^ -|-c=0 and the equation a;*+i/*-3a:t/ = 0 can be 
indicated by the points of intersection of some straight line (repre- 
senting ax -f6y -l-c = 0, as shown in 3.3 below) and the looped curve 
of Fig. 14. There is either a single solution, or thAe solutions, of the 
simultaneous equations according to the values of a, b and c and the 
position of the straight Kne. 

It appears that, in general, jiwo equations in two variables x and y 



Fia. 14. 
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provide a “determinate” solution, i.e. there is only a definite 
number of pairs of values of x and y satisfying the equations. This 
is a question which will be taken up at a later stage (see 11.6 below). 
We can, however, consider here one objection to the method of solv- 
ing equations as applied (e.g.) in economic theory. It is said that 
a “ circular ” argument is involved when we write two equations in 
two variables x and y ; the first equation gives y as a function of x, 
the second a; as a function of y and these are inconsistent. This is an 
argument that has no justification. 

If no relation connects the variables x and y, then each can take 
any value independently of the other and, if they have continuous 
ranges, we have a “ doubly infinite ” set of values of the pair (a?, y). 
If one relation is known to exist between x and y, then the number 
of possible pairs (a;, y) is much restricted. There is still an indefin- 
itely large number of such pairs but the range is only “ singly 
infinite ”. Finally, if another relation is known and added to the 
first, the possibilities are still further restricted and, except in odd 
cases, there is only one or a finite number of possible pairs (a:, y). 
In this case, x and y are said to be determined. In diagrammatic 
er ms, a pair of values of x and y is represented by a point P in a 
plane. If no relation between x and y is given, P can move at will 
over the plane. If one relation exists between x and y, then P can 
move only in a restricted way, i.e. along the curve representing the 
relation. If two relations between x and y are given, the possible 
positions of P are confined to the points of intersection of two curves 
and they are, in general, only of finite number. 

To take a concrete “ applied ” example : if the output x of an 
industry is known to be related to the price p of the commodity by 
a demand relation (giving the amoimts the market would take at 
various prices), then the possible values of x and p are limited but 
still indefinitely large in number. But, if we also know that the out- 
put is related to the price by a supply relation (giving the amoimts 
the industry would offer at various prices), the values of x and p are 
much more limited and there is, in general, only one or a fei 7 pairs 
of values posi^ible.^ Here we say that output and price are deter- 
mined. The two relations of demand and supply are independent 
of eack other, and neither alone determines both output and price. 
There is no circular argument , , 
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EXAMPLES n 


Functiona and graphs 


1. A railway ticket is in the form of a rectangle ABGD with sides AB — 2 
inches and BG = 3 inches. The comer D is clipped off by cutting along the 
line EF, where E lies on GD and F on AD so that GE = ^AF, Denoting GE 
by X inches, find the area of the clipped ticket as a function of x fuid represent 
it graphicaUy. 

2. A variable point P is taken on a semicircle drawn on the diameter 

AB = 4 inches. PN is perpendicular to AB and N is x inches along AB from A . 
Find the length of PN in terms of x. Deduce the area of the triangle APN as 
a function of x, and plot a graph of the function for 0 <4. 

3. A beam has a rectangular cross-section of sides x and (x + 2) feet and 
its length is y feet. Find ^ as a fimction of a? if the volume of the beam is 
100 cubic feet, and represent graphically. 

4. The appended table shows a simple way of obtain- 
ing values of the quadratic function y = 2a;* - 2a? + 1. 

Extend the table to give the values of the function for 
integral and half-integral values of a? from -3 to +3, 
and plot a graph of the function. Find an explicit ex- 
pression for a? as a function of y and deduce that two 
values of x correspond to each value of y > J. Examine 
the cases where t/ < J. 

6. Plot a graph of the fimction y =5a: - 2a?*. Show that y has a greatest 
but no least value. Locate the greatest value as accurately as you can from 
the graph. Between what values of a? is t/ positive? 


X 

1 

2 

2x* 

2 

8 

-2x 

-2 

-4 

+ 1 

1 

1 

y 

1 

5 


6. Graph y = 2a? - 1 H — for positive values of a?, and show how the graph 

a? j 

can be obtained by the addition of those of y = 2a? - 1 and y =- . Establish 
that y is always positive. ® 

7. Plot a graph of y=-— -- for positive values of x, and show how the 

graph for negative values of a? can be deduced. What are the greatest and 
least values of y? Explain what happens to the value of y as x is increased 
iiidefinitely. 


8. A single man earns £x a year. His taxable income, obtained by 
deducting one-fifth of the total income plus his personal allowance of £100, 
is taxed at 4s. 6d. in the £, except that the first £175 is taxed at only half the 
full rate. K the tax is equivalent to y shillings in the £ of original income, 
show graphically the variation of y for incomes between £100 and £1000. 

New regulations are introduced whereby the first £135 of taxable income 
is taxed at one-third of the standard rate and the rest at the full rate. Plot a 
new graph (on similar lines) emd compare the new values of y with the old 
values. 


• 2x + l 

9. Find the function inverse to y =-^ — and show tl^t it is sing! J*- valued. 

Represent graphically, and give some account of the behaviour of the graph 
in the neighbourhood of x = 1 and of y = 2. • 


10. A chord of a circle of radius 3 inches has length x inches and is distant 
y inches from the centre. Find c^relation between x and y. Express y as an 
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explicit function of x and a; as an explicit function of y. Are these functions 
single-valued or not ? 

11. Obtain an explicit expression for the function inverse to + - and 

show that it is not single-valued. ^ 

12. By selecting a sufficiently large number of values of x and finding the 
corresponding values of y, indicate that the function T/ = 3a;* + 3a;* -l-a; - 1 is 
monotonic. Does y increase or decrease as x increases ? Illustrate graphically. 


13. Show that a;*+6a5+c can be written in the form (a; + J6)* -1(6* -4c), 
and deduce that y =a:* +6a; +c has a least value when x = - but no greatest 
value. Hence indicate the general shape of the curve i/ =«• +6a; +c. What 
is the condition that the curve lies entirely above the axis of a:? Consider the 
curve y= -a;*-f-6a;+cina similar way. 

14. Combine the results of the previous example to indicate the nature of 
the curves represented by the general quadratic equation y =aa;* +6a; +c. 
Show that the curves are of the same t5q)e and can be divided into two groups, 
one group consisting of curves with a lowest but no highest point, and the 
other group of curves with a highest but no lowest point. 

15. A ball, thrown vertically into the air with a given velocity v, reaches 
a height x after time t. It is known that x is the difference between two 
independent factors. The first is the effect of the initial velocity v and is 
measured by v times t. The second is the effect of gravity and is represented 
by a constant \g times <*. Express a: as a function of t involving the two 
parameters v and g. Taking g = 32, find the height of the ball after 4 seconds 
if the initial velocity is 76 feet per second. 

16. The volume of a rectangular box of sides a?, (a;-f 1) and (a? + 2) inches 
is y cubic inches. Of what type is y as a function of a; T Generalise to give the 
volume when the smallest side is x inches and the other sides are respectively 
a inches and 6 inches longer. Arrange the function to show its type and its 
dependence on the parameters a and 6. 

17. Write down the general symbolic form for t/ as a cubic polynomial 
function of x, and show that functions such as y=x^, 3/ = l y =x(l -\-x*) 
and t/ = (a; + 1 ) (a: + 2) (a; + 3) are all included. 


18. How many parameters are needed to describe the general cubic 
functional relation between x and yl Show that the general expression 
includes such simple functions as 


1 _ a; + l 


and t/* =x - 


1 

X 


19. Show that the function type represented by j/= where a, 6, o 

cx -j-a 

and d are parameters, is a particular case of the general implicit quadratic 
function. Deduce that this type is such that both inverse functions are 
single-valued and of the same type. 


20. Find the values denoted by /( - 1), /( -J), /(O), /(i) and /(h), when 

* i iC — 1 1 

(a) f(x) —2x + 3f (6) / (x) = r and (c) f{x) = 2x* - 4a5 -f 3 + - . Explain why 

ZX + 1 . X 

/( - i) cannot be defined in the second case and /(O) in the last case. 

21. If f{x) * express /(2a) m terms of a.^ 
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22. Show that, for the function f{x) = 1 - 2x* + 3a;*, we have /( - a) =/(a) 
for all values of a. Indicate that this is true of any polynomial containing 
only even powers of x, 

Fmther, if /(a;) =2a; + 6a;* - a;®, show that/( - a) = -f{a) for any value of a. 
For what general pol 3 momial function of x is this relation valid? 


23. Find the value of f(-^ in terms of o, when /(a;) =a;* + 3a; - 2 and 
(2x — l)(x-2) 

when /(a?) = — - — r — . In the latter case, show that / ( - ) for any 

1 — JLx 4" a;* \o/ 

value of a. 


24. Find the value of /(a +/i) in terms of a and h when f(x) =- , f(x) =a;*, 

^ 2 ^ 

f(x) = 1 - 2a;* and f(x) = . Deduce an expression for the “ incrementary 

ratio ’• ^ 

f{a + h) -f(a) 
h 

in terms of a and h in each case. 


25. Find i/ as an explicit function of t when t/ =x* + 3a; - 2 where x = 1 + / 
and when y = 1 - x* where x =— — - . 

JL 


26. Obtain a relation between x and y when it is given that x = <* and 
y = 2t. Plot a graph of the relation by giving various values to t. 

27. The variables x and y are both given functions of t and, by eliminating 

an implicit function relating x and y is found. Obtain the relation in the 

following oases : 

(») *=<(<+2), y=<(<-i). 

28. A wooden letter tray is in the form of an open rectangular box, of which 
the base sides are respectively twice and three times the depth. Express the 
volume of the tray and the area of wood used as functions of the variable 
depth of the tray. Deduce a relation between the volume and area of wood. 
What is the volume if 4 square feet of wood are used in the construction of the 
tray? 

The solution of equations 

29. By writing factors, find the roots of each of the equations x* - 3x -f 2 = 0, 
2x* + 6x + 2=0, 6x*+x--2=0, x* + 6x* + llx 4-6 =0 and 2x* - 6x* +x 4- 2 =0. 
In the first three cases, check your results by means of the formula for the 
general solution of a quadratic equation. 

80. From the general formula, solve the qufidratio equations x* 4- 2x - 3 = 0, 
x* + 2x 4-3=0 and 3x*--7x-3=0. 

31. A beam has a rectangular cross-section of sides x and (x -i- a) feet €ind a 

volume of h cubic feet. If the length of the beam is c feet, show that algebra 
indicat(9S that there are two possible values of x, one of which does not “ fit ** 
the problem. Find an expression for the solution in teisns of the parameters 
a, b and c. * 

32. Show that each of the cubic equations x* 4- 6x* 4- 4x - 4 = 0 , x* •f 1 = 0 , 

and 2x* 4-x* - 9x - 2 = 0 has one integral root. Hence, complete the solution 
of ^ach equation. • ^ 
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33. By graphical methods, find the number and the approximate values of 
the roots of each of the equations : 

(a) (6) x»-2a?-6=0, (c) -3a?* - 1 =0. 

34. Verify the approximate solutions of the previous example by plotting 
graphs of 

(a) y and y =a?*+a? -2, (6) y =a?* and y = 2a: + 6, (c) y =a:* and y ^ . 

X X — o 

35. Show graphically that the equation a?* -7a? + 7=0 has two positive 
roots almost equal in value, and evaluate them approximately. 

36. Find the approximate values of the roots of a?* - 12a? + 7 = 0 and of the 
negative root of a?* - 12a?* + 12a? -3=0. 

37. By considering the forms of the general curves representing 

y =aia?* +a,a?* +a^ +04 and y—~ 

where the coefficients are constants, show that the general quartic equation 
must have either four, or two, or no real roots. 

38. By algebraic methods, solve each of the following pairs of simultaneous 
equations: 3a? + 6y = ll and 6a? -30y + 41 =0 ; 2a?-y + 3=0 and a?y=2; 
»-2y + l=0 and a?*+y*=9. 

39. Find the solution of the general pair of linear equations 

+6 i 2/ +Ci =0 and OjO? + 6|y + Cj = 0. 

Show that the solution fails only when — = ^ . Illustrate this case of failure 

a, 6a 

by attempting to solve the equations 2a? - y + 3 = 0 and 4a? - 2y + 6 = 0. What 
is the graphical reason for the case of failure ? 

40. How many solutions are to be expected in each of the types of simul- 
taneous equations : (a) a:y = 1 and aa? + 6y +c = 0, (6) a?y = 1 and y =aa?* +6a? +c? 
Find, by graphical methods, the approximate solutions of 

a?y = l and y=5a?-2a?*. 

41. The prices per bushel of wheat and rye are Pi and p, respectively. The 
m€uket demand for wheat is given by a?i =4 - lOpj +7p, and for rye by 
a?, = 3 + 7pi - Sp,. The supply of wheat is related to the prices by the relation 
a?i = 7 +Pj -p, and the supply of rye by the relation a:, = - 27 -pi + 2p4. Find 
the pair of prices which equate demand and supply both for wheat and 
for rye. 

42. In the problem of the previous example, a tax of ti per bushel is im- 

posed on wheat producers and a tax of per bushel on rye producers. Find 
the new prices for the equation of demand and supply (substituting pi - ti for 
Pi and Pj - tt for p, in the supply relations above). Show that the wheat price 
increases by an amount A - ^i) and the rye price by an amoimt A ( 14/t - 3<i). 
Deduce J^ha^ ** 

(а) a tax on \^hea^ alone reduces both prices ; 

(б) a tax on rye alone increases both prices, the increase in the rye price 

' being greater than the tax. 

(See Hotelling : Edgeworth's Taxation Paradox^ Journal of Political 
Economy, 1932, especially pp. 602-3.) , c 



CHAPTER III 

ELEMENTARY ANALYTICAL GEOMETRY 


3.1 Introduction. 

We propose, in the present chapter, to follow up a line of develop- 
ment already indicated (2.7 above) and examine, in some detail, the 
relation between function types and curve classes. Our investi- 
gations belong to what is termed “analytical geometry*' and our 
method is to take certain well-known classes of curves in turn, deter- 
mining what type of function corresponds to each class. We have 
then an analytical method of treating the geometrical properties of 
the curves and a diagrammatic method of illustrating analytical 
properties of the functions. 

Three simple, but essential, formulae must first be established. 
Two fixed points have co-ordinates P{x^y t/i) and Q[x 2 , t/ 2 ) referred 
to rectangular axes fixed in a plane Oxy, Then : 


(1) The distance between P and Q is sl{x 2 - Xj)^ + (y, - 


(2) The mid-point between P and 
Q has co-ordinates 

\ 2 ’ 2 /* 

(3) The point which divides PQ 
in the ratio p : q has co-ordinates 

/ qxi -hpx2 ^ qyi+^\ , 

The proofs of these formulae, in- 
volving quite simple geometrical 
notions, make use of the notation 
of Fig. 16 and apply no mat 
aqd Q take up in tjjie plane. 
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(1) Pythagoras’ Theorem, applied to the right-angled triangle 
■P^ifrgives 

+£Q^ = (X, - + (y, - !/,)». 

since PK —LM = OM - OL =X 2 -Xi, 

and KQ=MQ-MK=MQ-LP=y 2 -yv 


Hence, PQ = ‘J {x^ - x^)^ + (y^ - j/i)*. q.b.d. 

(2) If R is the mid-point of PQ, then a well-known property of 
parallel lines tells us that N is the mid-point of LM, i.e. LN =NM. 
If the co-ordinates of R are (x, y), then 

LN=ON -OL=x-Xi and NM =0M -ON =X 2 -x. 

So X-Xi = X 2 -X, 


i.e. 2 x=Xi+X 2 . 

Hence, x = ^ - ^ — * , and similarly 


(3) If 1? divides PQ in the given ratio 'p : g, then 


LN PR _p 
NM~~RQ~'q" 

and, if the co-ordinates of R are (x, y) so that 

LN =x - Xi and NM =Xg - x, 


then 

Simplifying, we obtain 
and similarly 


x-Xi_p 

X 2 -x~ q 

-^py2 

p+q 


Q.E.D. 


Formula (2) is a particular case of formula (3) with p=qz=i. It 
is interesting to notice the connection between these results and the 
notion of an average. The mid-point between P and Q is a kind of 
simple average point and each of its co-ordinates is the simple 
average of the corresponding co-ordinates of P and Q, In the same 
way, the point R dividing PQ in the ratio p : g is in the nature of a 
weighted average point between P and Q, Each of its co-ord!inates‘ 
is the weighted average of the corresponding co-ordinates of P and 
Q, P’s co-ordinates being weighted with q and Q’s with p. In fact, 
the larger is q relative to p, the nearer is R to P, and the larger is the 
weight of P’s co-ordinates in those of P. * 
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The co-ordinates of the origin 0 are (0, 0). If P (x, y) is any fixed 
point, the following are particular cases of the above results : 

(1) The distance OP^'Jx^ 

(2) The mid-point of OP has co-ordinates \y). 

(3) The point dividing OP in the ratio p : q has co-ordinates 

( py \ . 

\p-\-q' p +g/ 

3.2 The gradient of a straight line. 

The simplest curve class is the class of straight lines.* A straight 
line is fixed if two points on it are specified. We can, therefore, 
speak of the straight line PQy P and Q representing two fixed points 
on the line, but we must always remember that the straight line 
extends indefinitely in both directions. 



A most important property of a given straight line is its direction 
relative to a pair of fixed axes Ox and Oy, defined by means of the 
angle a (e.g. in degrees) that the line makes with the positive 
direction of the horizontal axis Ox, If the angle a is acute, the line 
slopes upwards from left to right ; if a is obtuse, the line slopes down- 
wards from left to right. The angular measure of the direction of 
a line, however, is not metrical. It is, therefore, inconvenient 
for mbst analytical purposes and a metrical indicator of, direction is 
needed. Such an indicator is readily provided^ by making precise 

♦ A straight line is a locus of points in a plane and thus a curve. • It can, 
however, be regarded as the “ limiting ” case of a curve which does not 
“*curve ** at all. » ^ 
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our everyday notion of a “ gradient We say, for example, that a 
railway track has a gradient of I in 10 if it rises one yard vertically 
for every ten yards horizontally. With the notation of Fig. 16, 
the straight line PQ ‘‘rises” a distance NQ over a “horizontal” 


distance PN. 


The ratio measures the amount of the rise per 


unit horizontal distance and can be called the gradient of the line. 
Hence : 


Definition : The gradient of the straight hne PQ referred to the 
axis Ox is the ratio of NQ to PN. 


Here P and Q are any two points on the straight line and N is 
the point where the parallel to Ox through P cuts the parallel to 
Oy through Q* 

It is clearly essential that the gradient should depend only on the 

direction of the straight line 
and not on the positions of 
the points P and Q selected to 
express it. This point needs 
investigation. Let P' and Q' 
be any other pair of points 
on the given line. Then, from 
Fig. 17, PQN and P'Q^W are 
similar triangles, and a well- 
known geometrical property of 
such triangles gives 
NQ NV' 

PN P'N'' 

The ratio is thus the same for all selections of P and Q, Exactly the 
same result follows if P' and Q' are any points on a line parallel to 
PQ. Hence, the gradient of a line depends only on its direction and 
all parallel lines have the same gradient. 

It must be stressed that the lengths NQ and PN of the gradient 
ratio must b© given signs according to the usual conventions.*' It is 
easily seen tha^ the gradient of a line sloping upwards from left to 

* Thje' gradient is sometimes referred to as the “ slope ” of the line. If oe is 
the angle the line makes with the axis then the gradient is the trigono- 
metric tangent of a, i.e. gradient = tan at. ( 
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' right is positive and that the gradient of a downward sloping line 
is negative. In the first diagram of Fig. 16, where the line slopes 
upwards, NQ and PN are both positive and the gradient is positive. 
Jn the second diagram, where the line slopes downwards, NQ is 
negative and PN positive, i.e. the gradient is negative. 

Finally, we can show how the magnitude of the gradient indicates 
the steepness of the line. Since aU parallel lines have the same 
gradient, we can draw lines through 
0 for convenience and Fig. 18 shows a 
number of such lines sloping upwards 
with increasing steepness. The points 
Pi, Pg, P3, ... are taken on the lines 
with the same abscissa OM, Then 

ifPi AfPa AfPg 

i.e. the gradient of the line increases 
as the line becomes steeper. In the 
same way, if a line slopes downwards, 
its gradient is negative but increases 
numerically as the line becomes steeper. The following results are 
thus established : 

(1) The gradient of a line is a metrical indicator of its direction, all 
^ parallel lines having the same gradient. 

^ (2) A line sloping upwards from left to right has a positive gradient, 
and a downward sloping line a negative gradient. 

. (3) The steeper the line, the larger is the numerical value of its 
gradient. 

Two limiting cases arise when a given line is parallel to one or 
other of the axes of reference. In Fig. 18, as the line becomes less 
steep and tends to coincide with Ox, the gradient decreases and tends 
to zero. Again, as the line becomes more steep and tends to coincide 
with Oy, the gradient increases indefinitely.* Hence, the gradient of 
a line parallel to Ox is zero and the gradient of a line parallel to Oy is 
indefinitely large, • 

The angle between two given straight lines is independent of the 

♦ In the terminology of the following chapter, the grewlient “ tends to 
infinity **, and the gr^lient of a Jpe pcuraUel to Oy is “ ia&iite 
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positions of the lines as long as the gradients aiQ fixed. In particular, 
the conditions that two fines are parallel or perpendicular involve 
only the gradients of the fines. If and are the gradients, then 
the lines are parallel if and perpendicular if mixm 2 = -1. 

The first of these results needs no further proof ; we have seen that 
parallel fines have the same gradient. The proof of the second result 
proceeds : 

In Fig. 19, P and Q are two points with the same abscissa OM on 
two perpendicular fines drawn, for convenience, through the origin. 
The triangles OMP and QMO are similar and, since MP and MQ are 
necessarily lengths of opposite sign, the ratio 
of MP to OM equals minus the ratio of OM 
to MQ. Hence, 



MP MQ_ 

om'"om~ ^ 


or X m 2 = - 1. 


The condition can also be written 




1 


m. 


or = -- 


1 


i.e. perpendicular lines are such that the 
gradient of one is minus the reciprocal of 
the gradient of the other. 


3.3 The equation of a straight line. 

A straight fine is fixed in position if two things, e.g. two points on 
the fine or one point on the fine and the gradient, are known about 
it. Our problem now is to find the equation of the fine referred 
to some selected axes of reference, i.e. the relation between the 
^ co-ordinates of a variable point on the fine. 

A fine' passes through the fixed point P with co-ordinates (Xj, 
and its gradient is known to be m. Then, from Fig. 16, 

NQ MQ-LP y-y, 
PN’~OM^OL'^x^x^' 

for any position of a variable point Q (x, y) on the fine. Hence, 

The equation of the straight line with gradient m passing through 
the fixed point is 
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Next, suppose that the line with equation (1) also passes through 
a second fixed point with co-ordinates (Xg, 2/2)* These co-ordinates 
must, therefore, satisfy the equation, i.e. equation (1) holds when 
X 2 is substituted for x and 1/2 for y : 

Pa-yi=m(Xa-Xi) or 

Hence, 


The gradient of the straight line passing through the two fixed 
points (Xi, 1/1) and (xg, j/2) is (2/2 “J/i)/(^2 equation of 

the line is 


y-yi = 


y2-yi 

X2 -Xi 


(X-Xi). 


( 2 ) 


The equation (2) can also be shown to be algebraically equivalent to 
the alternative form 


y-y2=~~(^-^2)- 

X2 Xj 

Attention is drawn to the values m, Xj and ^ appearing in the 
equation (1), and to the values x^, Xg, yi and yg ^ equation (2). 
For any given line, these values must be fixed but, by varying the 
values, different lines are obtained. The values are, in fact, para- 
meters and the equation (1) or (2) is a function type representing the 
whole class of straight lines. Various sub-classes of the complete 
class of lines are obtained by varying the parameters in defined 
ways. For example, if the point (Xj, y^) is kept fixed and the para- 
meter m varied, the equation (1) represents a set of lines of varying 
gradient all passing through a fixed point, a set technically known 
as a “ pencil of lines. Again, if m is fixed and the point (x^, j/j) 
varied, we obtain a set of parallel lines with a given gradient m. 

The equation of the straight line, in form (1) or (2), is seen to bo 
of the “ linear ” type, i.e. x and y appear only to the first degree. 
The converse is also true and any linear relation between x and y 
represents a straight line. The relation 

ax+6y-i-c = 0 (3) 

can be* written* 



* This arrangement of the relation holds, provided that 6^s0. When 6 = 0 
have a straight lii» parallel t^) Oy (see below). 
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On comparison with the equation (1), we see that (3) must represent 

a straight line of gradient - ^ and passing through ^ • So, 


The equation ax +by always represents a straight line and 

the equation of any line can be written in this form. 


There is, therefore, a perfect correspondence between the class of 
straight lines and the linear functional relation. 

We have shown, incidentally, how the gradient of a straight line 
can be determined from the equation of the line. The gradient of 

the line with equation (3) is given by - i.e. the gradient is minus 

the ratio of the coeJBScients of x and y when the equation is written 
with both terms on the same side. It follows that 

4-6iy +Ci = 0 and a^+b^y+c^^^O 


represent parallel lines if ~ ~ 

a-J)^ ^ 2^1 ~ ^ 

.nd perpendic«l„ li„« if ( - S) ( - ?-■) = - 1, U, if 

Ojag +6162 = 0. 




For completeness, we must examine the cases where a line is 
parallel to one or other of the axes. It is easily seen that the equation 
of a line parallel to Ox is of the form y = constant, and the equation of a 
line parallel to Oy is of the form x=^constant. This follows since the 
y-co-ordinate of any point on a line parallel to Ox has the same 
value as the y-co-ordinate of any other point on the line, and simi- 
larly for the a;-co-ordinates when the line is parallel to Oy. The 
results can also be derived from the equation (1). A line parallel to 
Ox has gradient m equal to zero and so its equation is 


or y=yi = constant. 

A line paraHel to Oji has an indefinitely large gradient. The equation 
(1) can be wiititen in the form 

m 
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and, when the value of m is increased indefinitely, the equation tends 
to assume the form 

z-Xi = 0 or a; =a;i= constant, 

which is the equation of a line parallel to Oy. Notice, also, that the 
equation (3) represents a line parallel to Ox when a = 0 and a line 
parallel to Oy when 6 = 0. The axes themselves are straight lines 
with equations y = 0 (the a:-axis) and a; = 0 (the y-axis). 

Finally, the equation of a straight line passing through the origin 
with gradient m has equation y—mx. So, the equation (3) represents 
a line through the origin if c = 0, when it appears in the form 

ax +6i/ = 0. 


3.4 The parabola. 

I A parabola is a curve defined as the locus of a point which is 
^ equidistant from a given point S and a given line d in a plane. The 
V point S is called the the line d is called the directrix and the 



line KS perpendicular from 5 to d is called the axis of the parabola. 
The distance of S from d (SK) is denoted by 2a, where a is some given 
constant. The general form of the parabola is shown in Fig. 20. The 
curve is symmetrical about its axis KS and must pass through the 
mid-point of KSy called the vertex V of the parabola. 

A vihole set of different parabolas are obtained by varying the 
positions of S and d ; the general shapes of all parabolas* are similar 
but the particular position and shape of a parabola depend on the 
location of S and d. The most important thing distinguishihg one 
parabola from ano(^er is the distance of S from d, i.e. the value of 
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/ 

the parameter a. If a is small, then S is close to d and the parabola 
is elongated as shown in the first curve of Fig. 20. If a is large, then 
S is distant from d and the parabola is flat as shown in the second 
curve of Fig. 20. 

The equation of a given parabola can be obtained when the axes 
of reference are selected. We propose, here, to select the axes in such 
a way that the axis of the parabola is vertical, i.e. parallel to Oy. 

The position of the origin is 
still at choice and it is found 
that the simplest result is 
obtained when the origin is 
taken at the vertex of the 
parabola. In this case, the 
equation of the parabola is 
derived as follows. 

Take axes of reference as 
shown in Fig. 21 and suppose 
that the focus S is above the 
vertex of the parabola. The 
point S has co-ordinates (0, a) and the line d is parallel to Ox and a 
distance a below it. If P is any point (x, y) on the parabola, then 
the definition of the curve gives SP=^MP, But 

SP = »Jx^ +(t/ - a)* and MP =NP +MN =y -hot. 



So, x*-h{y- a)2 = (y 4-a)2 

and this, on reduction, gives the equation of the curve in the form 


' On the other hand, if S is below the vertex, the co-ordinates of S 
are (0, - a) and d is parallel to Ox and a distance a above it. An 
argument exactly similar to that given above then shows that the 
equation of the parabola is 



The following .result is thus established : 


The" equation of a parabola with axis vertical and vertex at the 
origin of co-ordinates is y=:ax^ where a is some constant. 
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The sign of the constant a indicates the position of the parabola. 
If a is positive, the axis of the parabola points upwards (as in Fig. 
20 ) ; if a is negative, the axis of the parabola points downwards. 
The numerical value of a is the reciprocal of twice the distance of S 
from d. If a is large, then S is near d and the parabola is elongated ; 
if a is small, S is distant from d and the parabola is flat. 

Suppose, now, that the axes are selected so that the vertex of 
the parabola has co-ordinates (^, 77 ), the axis of the parabola being 
parallel to Oy as before. Then, if S lies above the vertex, the co- 
ordinates of S are (^, 7 ; +a) and the directrix d is parallel to Ox and 
a distance (17 - a) above it. If P is any point {x, y) on the parabola, 
the condition that P is equidistant from S and d gives 

s/(x - +(y -r) - a)^ =y - {t) - cc). 

On squaring each side of this equation and expanding, we obtain 
{x-^Y +{y - t]Y - 2x{y -rj) +a*=:(i/ - tj)» + 2 a(y - tj) +a*, 

i.e. 

Similarly, if S lies below the vertex, the equation is 

(!,-,)= -l(x-£).. 

So, 

The equation of a parabola with axis vertical and vertex at the 
point 77) is {y -7])=a(x-^Y, 

The constant a is to be interpreted, in sign and magnitude, as 
before. The only difference between this general form of the equation 
of a vertical parabola and the previous simpler form lies in the sub- 
stitution of {x - for X and (y - 77) for y. When | =77 = 0, the vertex 
is at the origin and the general case reduces to the special case. 

The most important feature of the general equation of a vertical 
parabola is that it gives y as a quadratic function of x. The converse 
of this is also true. The general quadratic function 
* y=^ax^-\-bX’\-c ^ 

can be written (by completing the square in a;) in the* form 

/ bV b^-^ac 
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/ b*-4ac\ ( b\* 

Comparing this with the equation of the parabola above, we deduce i 

The equation y=az^ -{■bx-\-c, for any values of the constants a, 6 
and c{a^O), represents a parabola with axis vertical. 

The constant a is to be interpreted exactly as before and the vertex 
of the parabola is at the point with co-ordinates 

/ 6 6^ - 4ac\ 

\ 4a ) ‘ 

There is, therefore, a correspondence between the quadratic function 
type and the class of parabolas with axis vertical.* 

3.5 The rectangular hyperbola. 

A rectangular hyperbola is a curve which can be defined as the 
locus of a point the product of whose distances from two fixed per- 
pendicular lines is a positive constant a^. The fixed lines are called 

the asymptotes and their point of 
intersection the centre of the rect- 
angular hyperbola. . A convention 
is required regulating the signs 
to be allotted to the distances 
of a point on the curve from its 
asymptotes. Here, we draw one 
asymptote horizontally and the 
other vertically, taking distances 
above the first asymptote and to 
the right of the second as positive. 
The general form of the curve is 
shown in Fig. 22. There is one 
portion of the curve in the positive 
(or N.E.) quadrant of the plane, and, as a point P moves along this 
portion to the right, the distance PM from the horizontal asymptote 

♦ The quadratic function and its graphical representation have been con- 
sidered in the previdUs chapter (e.g. Ex. 1 of 2.2). From our point of view, 
parabclas are important because they represent such a simple and useful 
function type as the quadratic. 
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decrea43es in the same proportion as the distance PN from the 
vertical asymptote increases. The area of the rectangle ONPM 
remains constant (a*), and the curve approaches but never outs the 
asymptotes. There is a second portion of the curve, exactly similar 
in shape, in the S.W. quadrant. 

A whole set of different rectangular hyperbolas is obtained by 
varying the positions of the asymptotes and the value of the constant 
a*. The general shape of all curves is the same but the particular 
shape of one curve, when the 
asymptotes are fixed, is de- 
termined by the value of 
a*. Fig. 23 (where, for con- 
venience, only the portions 
of the curves in the positive 
quadrant are drawn) shows 
a set of different rectangular 
hyperbolas corresponding to 
the values of a* indicated. 

In obtaining the equation 
of a rectangular hyperbola 
here, we take the co-ordinate 
axes parallel to the asymp- 
totes of the curve, only the 
position of the origin being at choice. The simplest case arises 
when the origin is taken at the centre of the curve, i.e. when the axes 
are the asymptotes. If P is any point {x, y) on the rectangular 
h3q)erbola in this case, then in Fig. 22 

NP X MP — (x}, 

But, NP = OM =x and MP — ON = 

So, xy=oc^. 

The equation of a rectangular hyperbola referred to its asymptotes 
as axeS h xy = where is a constant. ^ 

oc* 

The equation can be written y = — or a;=— , showing that j/ is a 

^ y 

single-valued decreasmg function of x, and conversely. 


y 



Oj 1 2 a 4 a: 

Fig. 23 , 
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In the general case, where the asymptotes are parallel to the axes 
and the centre of the curve is at the point (|, rj), the distances of 
any point (a;, y) on the rectangular hyperbola from the asymptotes 
are seen to be (a; - and (y ~ r)) respectively. The equation of the 
curve is then provided by the condition that 

The equation of a rectangular hyperbola with asymptotes parallel 
to the axes and centre at the point (^, is (a; - {y - r))=oc^. 


This general form is only different from the previous special case 
in that (a; -f) replaces x and {y -r)) replaces j/. When ^=r)=0, the 
centre is at the origin and the special form is obtained. 

The general equation of the rectangular hyperbola in the form just 
obtained is found to give y as the ratio of two linear expressions 
\n X, y being a single-valued decreasing function of x. Similar 
remarks apply to a; as a function of y. The converse is also true. 
The function type 

a^x 4 - 61 

y = -± ^ 

a ^-¥02 

can be written in the form 


{ a ^ +62)2/ - (oiX +61) = 0, 


i.e. 


Oi 


1? 

0/2 




xy - — X y - — 


On factorising, this reduces to the form 

\ 02/ V O 2 / O^ 

Comparing this with the above equation of the rectangular hyperbola, 
we deduce : 


The equation y = ~ for any values of the constants ( 02 ^ 0), 
o^pc -I-O2 

represents a rectangular hyperbola with asymptotes parallerto the 
co-ordinate axes. 

The centre of the curve is at the point ( - — , The class of 

\. 02 * 02 / 
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rectangular hyperbolas, with asymptotes parallel to the axes, is in 
correspondence with a simple algebraic function type.* 

3.6 The circle. 

/ A circle is the locus of a point which moves at a constant distance 
from a fixed point. The fixed point is the centre and the constant 
distance the radius of the circle. The circle is a curve discussed in 
all elementary geometries and needs no description here. It can be 
stressed, however, that there is a whole class of different circles 
obtained by varying the position of the centre and the length of the 
radius. 

A circle is symmetrical about any diameter. The choice of the 
origin, therefore, is more important than the direction of the axes 
in obtaining a simple equation of a given circle. If the origin is 
taken at the centre of the circle, the equation of the circle is found 
(by the method described in 2.3) to be 

a;* 

where a is the radius. This is the simplest equation possible for the 
circle. Similarly, if the centre has co-ordinates (f , ri) referred to the 
selected axes, the equation of the circle is 

This is the general equation of a circle. 

If the squares in this latter equation are expanded, the equation 
of a circle is seen to be a quadratic relation between x and y of such 
form that the coefficients of and are equal and the term in xy 
absent. The converse of this result is also true. By completing the 
squares in both x and t/, the quadratic relation 
x^ ^-y^ -\-ax -\-by - 1-0 = 0 
can be written in the form 

(x + 4 - {y J (a* - 1 - 6 * - 4c). 

♦ Our expression of this result is not quite complete. The function type 
only repvesents a rectangular hyperbola of the form considered if (a,6i - 
is positive. If that quantity is negative, the curve represented by tlie function 
type is of the form {x - i)(y -rf)= - a*. This is a rectangular hyperbola lying 
in opposite quadrants of the plane to that shown in Fig. 22. 

The function type represented by a rectangular hyperbola (i.e. the ratio of 
linear expressions) will be considered particularly in the following chapter (see 
Ez^ 7 of 4.2). ^ 
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The equation thus represents a circle with centre at the point with 
co-ordinates - \b) and with radius J + 6* - 4c. 

This statement, however, needs some qualification. It is only 
when c<\(d^ +6^) that the radius has a definite and positive value. 
If c = +6^), the radius is zero and the circle reduces to a point. 

If c>J(a^+6*), the square of the radius is a negative quantity, 
i.e. no real radius or circle exists at aU. In this last case, the locus 
represented by the equation contains no points whatever. Hence, 

If c<J(a*-f6*), the equation -\-y^ ^-ax -^by — 0 represents a 
definite circle with centre at the point ( - Ja, - \b) and with radius 
— 4c. 

3.7 Curve classes and curve S 3 rstems. 

We can now pass to a number of more general considerations in 
the field of analytical geometry. The curves we have discussed 
belong to particular curve classes, the class of straight lines, of 
parabolas, of rectangular hyperbolas or of circles. In each case, an 
equation has been derived to represent the whole curve class and the 
equation is a particular fimction type involving certain parameters 
Analytical geometry, in fact, is simply the study of curve classes in 
relation to the corresponding function types, and, if a definite curve 
of a certain class is given, we need only substitute the appropriate 
values of the parameters in the corresponding equation or function 
type. Our results can be summarised : 

Curve class General equation 

‘ Straight lines ax + by 

Parabolas (axis vertical) - - - - y = ax^ -\-hx-\-c 

Rectangular hyperbolas (asymptotes parallel ^ 
to the axes) ^ 

Circles a:*-f-y^+aj: + 6y+c=0 

A number of useful comparisons can bo made. The general 
equation of a str^ght fine or rectangular hyperbola gives t/ as a 
single-valued monotonic function of x, and conversely. The general 
equation of a parabola gives y as a single-valued (but not monotonic) 
function of x while a: is a double-valued function of y. The genc^ral 
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equation of a circle gives y as a double-valued function of a;, and con- 
versely. These results correspond to obvious geometrical properties 
of the curves themselves. For example, any parabola is cut by a 
line parallel to its axis in only a single point, whereas it is cut by 
lines perpendicular to its axis either in no point or in two points. 
The number of parameters in the equations of the different curve 
classes is significant. The general equation of a straight line con- 
tains two parameters, the ratios of a, b and c.* This corresponds to 
the fact that two things (e.g. two points on the line) are needed 
to define the position of the line. On the other hand, the general 
equations of the parabola (axis vertical), the rectangular hyperbola 
(axis parallel to the asymptotes) and the circle are function types 
containing three parameters. Three things are required to fix the 
position of one of these curves ; a definite vertical parabola can be 
drawn, for example, to pass through three points known to lie on the 
curve. 

Special sub-classes of a complete curve class can also be con- 
sidered. In a sub-class, the number of parameters in the corre- 
sponding function type or equation is less than the number in the 
complete class, and the most useful sub-classes are those defined 
by a single parameter. A sub-class of curves of this latter kind is 
described as a system of curves. We can define, therefore, various 
systems of straight lines, parabolas, rectangular hyperbolas or 
circles, each system corresponding to a functional equation with one 
parameter. The following examples are instances of some of the 
many ways in which systems of curves can be specified. 

Ex. 1. The two most important systems of straight lines have been 
referred to already (3.3 above). They are the system (or pencil) of straight 
lines through a given point and the system of parallel straight fines with 
a given gradient. 


♦ In the linear equation ox 4-6t/ +c =0, both a and h cannot be zero. If 

o c , 

we can write y = (xx+p where a = - t and p=z if a^O, we can write 
6 c ^ ^ 

X = a'y V P' where a' = - - and /3' = - ~ . There are only two independent 
parameters, either a and j8 or a' and p'. The same reasoning shows that the 
equation of the rectangular hyperbola y = — ~ contains only three inde- 

OffiC "f* 0 j * 

pendent parameters, the ratios of Oj, 6i, o, and 6,. We have only to divide 
nuAierator and denonejnator by (which cannot be zero) to see this. 
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Ex. 2. The system of parabolas with axis vertical and vertex at the 
origin is defined by the equation y=^ax^ with one parameter a. A less 
obvious system of parabolas is given by the equation 

y -h(a - l)y 0<a:<a (Zt - -h 

where A and k are fixed positive numbers (A> Jk) and a is a positive para- 
meter. Any parabola of the system has vertex at a point {h(a - 1), -k) 
lying on a fixed line parallel to Ox, and its focus is a distance \a^ from the 



quadrant is taken. The system of 
where certain curves of the system 
drawn. 


directrix. As the parameter a 
increases, the vertex moves to the 
right and the parabola becomes 
flatter. A parabola of the system 
cuts Ox where 

y — 0 and x^h{a -1) ±ajk. 

It cuts Oxy therefore, at only one 
point, at the end of the range of 
values of x considered, i.e. at the 
point where a; = a (A - Jk) - h. The 
curve also cuts Oy at one point 

where a:=0 and y -k. 

As a increases, the parabola cuts 
the axes at points farther and 
farther from the origin. Notice 
that the range of x is so chosen, 
in each case, that only the part 
of the parabola in the positive 
parabolas is illustrated in Fig. 24, 
obtained when ^ = 10 and Z;=4 are 


Ex. 3. The system of rectangular hyperbolas with asjmaptotes as axes 
is defined by the equation xy^a^ with one parameter a. This system is 
illustrated in Fig. 23. 

A rather different system of rectangular h 3 rperbolas is defined by 

where h and k are fixed positive numbers and a is a parameter. Here 

{x-\-h)(y^k)^ay 

i.e. the rectangular h 3 rperbolas have fixed^ asymptotes parallel to the a^ces 
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and with centre at {—h, —k). The range O^x^^—h limits each curve 

iC 

of the system to the positive quadrant. Certain curves of the system in 
the case A = 2, A = 1 are shown in Fig. 25. 



oj ^--V/ — ^ 

Fig. 26 . 

Ex. 4. The equation represents a system of concentric 

circles, a being a parameter. A different system of circles is defined by 
the equation 

a; + y + V2xy=a, 0<a;<a, 

where a is again the parameter. (The square root is assumed positive.) 
The equation can be written 

J2xy —a -a; -y, 
and, on squaring and collecting 
terms, we obtain 

(a;-a)^ + (y-a)2=a*. 

The circle of the system with 
parameter a thus has centre 
(a, a) and radius a; as the 
parameter a increases the centre 
of the circle moves away from 
O along a line bisecting the angle 
between the axes and the radius 
of the circle increases. From 
the equation, it follows that 
0<x + y<a and, since x is 
limited to the range O^x^a, both x and y are positive and less ^han a. 
This limits each circle of the system to one quarter of its circumference, 
as.shown in Fig. 26*, 
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Any system of curves defined by a single parameter o can be 
represented by the equation 


/(a?, y; a)=0, 

where, for convenience, the parameter is included in the functional 
expression. An important kind of curve system arises when the 
curves do not intersect each other, one and only one curve of the 
system passing through each point of the relevant part of the plane. 
Here, each pair of values of x and y is associated with only one value 
of the parameter a, i.e. the value defined by the curve of the system 
passing through the point (a;, y). The parameter a can, therefore, 
be separated off explicitly and the equation of the curve system can 
be written in the form 

f{x, y)=a. 

Each of the curve systems instanced above is of this type.* In 
particular. Figs. 24, 25 and 26 show non-intersecting curve systems 
in the positive quadrant of the plane and their equations can be 
written 


X +h 


;=:a, (x -\-h)(y ■\-k)=a and x -\-y -i-J^xy^a 


A - Vy 

respectively, i.e. in the form f{x,y)^a. Non-intersecting curve 
systems are of particular interest in economic theory (see 5.7 below). 


3.8 An economic problem in analytical geometry. 

The consumers of a certain commodity are distributed over a geo- 
graphical area which can be considered as a two-dimensional plane. 
The commodity is produced by two firms situated respectively at 
the points and .4a in the plane. The problem is to determine how 
the whole area is divided into two parts by a curve, the consumers of 
one part obtaining their supplies of the commodity from the firm at 
Ai and the consumers in the other part buying from the firm at .4a.t 
The solution of the problem depends on what is assumed about the 
price “ at works charged by each firm, and about the nature of 
the transport costs from factory to consumer. It is assumed, here, 

* If the C7irv% systems of Figs. 24 and 20 were not confined to the positive 
quadrant in the way shown, they would cease to be of the non-intersecting 
type. 

t The solution of this problem, as given here, is based on the work of Prof. 
Schneider. See Bemerhungen zu einer Theorie der Raumivirtachaft, Econo- 

metrica* 1936« ^ 
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that the factory price p per unit of the commodity is the same for 
both firms. Further, it is taken that the transport cost varies 
directly with the distance of the consumer from the factory con- 
cerned, the distance being measured in a straight line. The transport 
cost per mile is, however, different for the two firms, being per unit 
of the commodity for the firm at and ^3 for the firm at To fix 
ideas, we can take so that it is more expensive, other things 

being equal, to buy from the firm at A^. 

If a consumer is miles from A^ and miles from A 2, then he 
pays (p + per unit of the commodity if he buys from A^ and 
(p +^2-52) if his supply comes from 
A 2^ Assuming that consumers 
buy from the firm for which total 
price is cheaper, the boundary 
between the two areas required is 
defined by the points in the plane 
where the total prices for the two 
supplies are equal : 

P +^5i=p -f^2^2 or t^Sj^:=^t2S2^ 

Choose axes of reference as shown 
in Fig. 27 , where the origin 0 is 
taken as the mid-point of A^A2- 

If the firms are 2 a miles apart, Ai is the point ( - a, 0 ) and ^ 2 is 
point (a, 0 ). If P is any point {x, y) on the boundary, then 

5^ = +a )2 and 82=^^ {x -a^ 

and -f-a )2 ^y"^ ~t2^ {x - a)'^ +2/^, 



i.e. t^(x^ -f -\- 2 ax +a^)~t^{x^ - 2 aa; +a^). 

Collecting terms and dividing through by (^2^ - ^i^), we have 

xi+yi-2a~~z+a^ = 0, 

as the equation of the boundary referred to our selected axes. Write 


6 = 




a>a. 


The boundary is then seen to be a circle with centre at (6, 0) and with 
radius The circle has centre beyond ^2 on the line 

and encloses the pointy g as shojm in the figure. On the assumptions 
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we have made, therefore, the firm at A 2 supplies all consumers 
within this definite circle, and the firm at Ai supplies all consumers 
outside the circle. Many other problems of this kind can be devised 
by var3dng the assumptions ; the method of solution is similar in 
all cases. 


EXAMPLES III 
The straight line 

1. Find the distance between the points (1, 2) and ( -2, 1) and the co- 
ordinates of the mid -point between them. Verify your results by drawing an 
£tccurate graph. 

2. Obtain an expression for the distance between the points 2atx) 
and 2a^,), where a, t^ and t^ are constants. 

3. Show that the points (1, 1), ( - 3, - 1) and ( - 4, 1) form a right-angled 
triangle. 

4. Show that the point (2, - 1) forms an isosceles triangle, and the point 
1 - s/3) an equilateral triangle, with the pair of points (1, 2) and ( - 1, 0). 

6. A quadrilateral is formed by the points A (6, 2), B (20, 10), G (3, 8) and 
D ( - 12, 0). Write down the equations of the sides and show that the quad- 
rilateral is a parallelogram. Show that the diagonals AG and BD have the 
same mid-point, i.e. bisect each other. 

6. By considering an isosceles right-angled triangle, show that the gradient 
of a line at 46® to the horizontal is unity. 

7. An equilateral triangle has a side of 2 inches ; show that the perpen- 
dicular from a vertex to the opposite side is of length <^y3 inches. Deduce that 

the gradients of lines making angles 30® and 60® with the horizontal are 
and ^/3 respectively. 

8 . Find the equations of the lines passing through the point (2, 1) (a) with 
a gradient of 2, (6) making an angle of 60® with Ox, 

9. Show that the line joining the points (a, 6) and (ka, kb) passes through 
the origin whatever the values of a, b and k. 

10. Find the equation of the line joining the points (4, 3) and ( ~ 2, - 1) and 
deduce that the point (1, 1) is collinear with these two points. 

11. Show that the line joining the points (-1, -J) and (2, 1) passes 
through the origin, which is one point of trisection. 

12. Find values of a, b and c so that the line oa; +c =0 passes through 
the points (1 , i) and ( - 1, 2). Verify your result by writing down the equation 
of the line joining these points. 

18. Find the gradient of the line 2a: - 3t/ + 1 = 0 and the co-ordinktes of the 
points where it cmts the axes. Hence, describe the simplest method of 
plotting the line on squared paper. 

14. By finding the condition that th» point (a:, y) is equidistant from the 
points (3, 1) and (-1,2), obtain the equation of the line which is the perpen- 
dicular bisector of the line joining these points. » c 
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16. Write down the gradients of the lines x -y + \ —0, a;+f/-}>7=0 and 
2 a? ~ 3 t/ + 1 = 0, and show that they form a right-angled triangle. Plot a graph 
to illustrate this fact. 

16. Obtain the equations of the lines which pass through the point (2, 2) 
and are respectively parallel and perpendicular to the line 2 a?+j/- 3 = 0 . 
Verify graphically. 

17. Find the co-ordinates of the vertices of the triangle formed by the 
lines a; = l, ®~ 32 /~l =0 and a; -f i/ -6 =0. Show that the line joining the 
point (2, 1) to any vertex is perpendicular to the opposite side. What is 
the geometrical meaning of this result? 

18. Show that the lines 3a;+t/“4=0, 05-22/ + 1=0 a; + 61 / - 6 =0 all 
piiss tlirough the same point. Show that the equation of the third line can be 
written in the form (3x + y-4)-A(a;-2y + l)=0 for some value of A. Why 
should we expect this? 

Curves and curve systems 

19. Use the definition of the curve to find tlie equation of the parabola 
with focus at the point ( 2 , 1 ) and directrix 2 /= - i- 

20. Plot a graph of the curve y = 05 *. Mark the position of the focus and 
directrix of this parabola and verify that points on the curve are equidistant 
from focus and directrix. 

21. Locate the focus and the directrix of each of the parabolas 

2 / =05* + 3x - 2, y = 2a;* - 2a; + 1 and y = 6 x - 2x*. 

Compare the parabolas as regards size and position. 

22. By considering the parabolas which represent the fimction, show that 
y -ax* + 6 x +c has a greatest value if a is negative and a least value if a is 
positive. At what value of x do these greatest and least values occur? (Cf. 
Examples II, 14.) 

23. Write down the equation of the rectangular hyperbola with asymptotes 
parallel to the axes, with centre at ( - 1, 2) and with a* = 3. 

24. Find the centre and the asymptotes of the rectangular hyperbola 
xy -2x - y - I = 0. (Cf. Examples II, 9.) 

25. If a point moves so that the difference of its distances from the fixed 
points (o, a) and ( - o, - a) is always 2 a, show that the curve described is the 
rectangular hyperbola xy = Jo*. 

26. If the two fixed points of the previous example are 0 ) and 

( - V'^o, 0 ), show that the equation of the rectangular hyperbola is x* - y* = 0 *. 
Plot a graph of the curve when a = 1 and verify that it is the same curve 
as in the previous example with different axes of reference. Where are the 
•symp totes now? 

27. Wiite down the equation of the circle with centre at the point ( 2 , f) 

and with radius f . Where does the circle cut the axes ? ^ • 

28. Find the co-ordinates of the centre and the radius of tfie circle whose 

equation is x* +y* - 3 x = 0 . . 

29. Show that (x + 1 ) (x - 2 ) +y* - 1 = 0 is the equation of a circle with the 
. fixed points ( - 1 , 1 ) an€ ( 2 , - 1 ) the ends of a iameter. 
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30. From the results of Examples II, 38, deduce the length of the chord 
cut off the line 2a; - 1 / 4- 3 = 0 by the rectangular hyperbola xy = 2, and the 
length of the chord cut off the line x - 22 / + 1 = 0 by the circle x* 4 - 1 /* =9. 

31. Re-examine the problems of Examples II, 40, in the light of the results 
of the present chapter. 

32. Draw a diagram to illustrate the system of curves 2 /=N/a*-x*, where 
0 < 0 ; <a and whore the positive square root is taken. What are the curves ? 

33. If h and k are fixed positive numbers and a a positive parameter, show 

that y=k --i (a; 4 -/ 1 )*, where 0 <a; - h, represents a system of parabolas, 

CL* 

only the parts in the positive quadrant being taken. Draw a graph of certain 
curves of the system when h — 6 and A; = 25, and verify that each curve is 
concave to the origin. Compare with the system of parabolas of Fig. 24, where 
each curve is convex to the origin. 

34. Show that {x -h){y - k) =a, where 0 < a; < ^ , and where h and k are 

fixed positive numbers and a a positive parameter, represents a system of 
rectangular hjq^erbolas confined to the positive quadrant. Taking h — 2 and 
A; = l, plot a graph showing the four curves of the system corresponding to 
a = i. i» 1 and Compare with the curve system of Fig. 26. 

35. Consider the curve system y = {a- x)* where o is a positive parameter 
and O^x^a. Draw a diagram showing the curves for a = l, 2, 3 and 4, 
verifying that the curves are non -intersecting and convex to the origin. 

36. In the problem of 3.8, two firms are 10 miles apart, the price of their 
product at works is the same for each and transport cost per mile per unit of 
the commodity is three times as high for one firm as for the other. Show that 
the former firm supplies an area within a circle of radius 3J miles. Draw a 
graph to illustrate the distribution of the market. 

37. In the problem of 3.8, the two firms are 2c miles apart, the prices at 
works are £pi for one firm and £p^ for the other and transport costs are £t per 
mile per unit of the commodity for each. Show that the curve separating the 
areas supplied by the firms has equation (referred to the same axes as in 3.8) 

T--~~ = l, where a = - " 5*=c*-a*. Plot a graph to illustrate the 

£X* 0* 

distribution when pi=£32, jo, = £40, <=£1 and 2c = 10 miles. 



CHAPTER IV 


LIMITS AND CONTINUITY OF FUNCTIONS 


4.1 The fundamental notion of a limit. 

We are now in a position to begin the promised discussion of the 
concept of the ‘‘ infinite ” so essential for the full appreciation both 
of the power and of the limitations of mathematical analysis. We 
must first go back to the number system itself and elaborate the 
associated notions of order, continuation and limit.* 

Real numbers are capable of arrangement in order of magnitude 
and we can select, from this order, a particular set of numbers 
according to any rule we care to specify. For example, 


1, 2, 3, 4, 6, 6, 

1 2 3 4 5 A 

2> 3> 4> 6> 6> 7> 


are two instances of increasing sequences of real numbers and it is 
clearly possible to quote many other examples. All such sequences 
display the property that the numbers of the sequence can be 
written down one after the other, by the rule of selection, without 
coming to an end. This is the property of indefinite or endless con- 
tinuation. 


Increasing sequences can be divided into two cla^sses. In one class, 
of which the first sequence above is a member, the numbers of the 
sequence get larger and larger without limit as we proceed through 
the order. Given any number we like to name, no matter how large, 
we can always find a number of the sequence which exceeds it. The 
numbers of an increasing sequence of this class are said to tend to 
infinity and here we have the idea of the infinitely large, the endless 

continufition of an order of larger and larger numbers. 

• 

* Much of the detail of the present chapter is, perhaps, unsuitable at a first 
reading. In any case, only general ideas concerning limits and continuity 
need be obtained at first and, after the development of the differential c^culus 
in subsequent chapters, the reader can return profitably to a further study of 
the •present chapter in Conjunction with the examples set at the end. 
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In the other class, of which the second sequence above is a member, 
the numbers increase as we proceed through the order but not with- 
out limit. On the contrary, the particular example above is such 
that the numbers steadily approach nearer and nearer to the value 1, 
without ever attaining this value. We can get a number as close as 
we like to 1 by going far enough through the order. In this case, 
the numbers of the sequence are said to tend to 1, and 1 is the limit y 
or limiting value, of the sequence. Each increasing sequence of this 
second class tends to some dejinite number as a limit and, since the 
interval between any member of the sequence and the limit is sub- 
divided ever more and more finely by later members, we have 
the basis of the idea of the indefinitely small or continuous, an idea 
again dependent on endless continuation.* 

In the same way, we can select decreasing sequences of real numbers, 
i.e. sequences in which the order of magnitude of the numbers is 
reversed. Two examples of decreasing sequences are 

1, 0, ~1, -2, -3, -4,..., 

2 t, t, f, h ••• • 

Again, there are two classes, one class illustrated by the first and 
the other by the second example. In a sequence of the first class, 
the numbers grow indefinitely larger numerically, but through 
negative values instead of positive values. Such a sequence is said 
to tend to minus infinity and we have the idea of the infinitely large 
and negative, of “ minus infinity ’’ as the counterpart to “ infinity 
In the second class of sequence, the numbers tend to a limiting value, 
decreasing to the limit instead of increasing to it. In the second 
sequence above, the numbers decrease and approach the value 1, 
i.e. the sequence tends to 1. 

♦ It may be noticed that we have given no justification for the statement 
that there are only two classes of increasing sequences, those tending to infinity 
and those tending to a finite limit. We have not excluded the possibility of 
other cases of increasing sequences. It is, however, a fundamental property of 
the real number system that cases other than the two hero mentioned do not 
exist. We must assume the result without proof here. In passing, we can 
remark that the same property is not true of the system of rational^numhers 
only. An increasing^sequenco of rationale may tend neither to infinity nor to a 
rational number as limit. This is, in fact, the way in which irrational numbers 
can be defined — ^as the limits of increasing sequences of rationale not tending 
to infinity or to a rational limit. For a discussion of these fundamental points, 
based on the work of Dedekind and Cantor, see Hardy, Pure MathemcUica 
r3rdEd., 1921), pp. 1 31. ^ c 
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Finally, sequences of real numbers can be written down in which 
the order does not proceed according to magnitude at all, sequences 
which are not increasing or decreasing. Any of the three possibilities 
already described may obtain for a general sequence of this kind, 
i.e. the sequence may tend to infinity, to minus infinity or to a 
numerical limit. For example, the sequence 

2> 2f 4> 4 > e» e» ••• 

(obtained by “ crossing two previously quoted sequences) is 
neither increasing nor decreasing, but tends to the limit 1. An 
additional complication is here introduced. The sequence may be 
such that n^one of the three possibilities holds ; the numbers of the 
sequence may ‘^scillate ” in value without displaying any of the 
three regular progressions discussed above. An example of such a 
sequence is 

1, f, 3, f, 5, i ... . 

Here the sequence is said to oscillate without tending to a limit. 

A given sequence of numbers, therefore, can behave in one of four 
ways. The sequence may 

(1) tend to infinity, 

or (2) tend to minus infinity, 

or (3) tend to a numerical limit, 

or (4) oscillate without tending to a limit, 

as we proceed through the order of the sequence. 

All concepts of the infinite in mathematics, whether of the infinitely 
large or of the indefinitely small or continuous, are based on the idea 
of an endless sequence of numbers. In general terms, the infinitely 
large corresponds to a process of multiplication without end, the 
indefinitely small to a process of division without end. The indefin- 
itely large and small are thus closely related (as multiplication is 
to division) and they are merely two aspects of the same essential 
notion of order and continuation. Further, the notions of the 
infinite tire implicit in the real number system itself. The system is 
of infinite extent since the order of the number^ can .be continued 
indefinitely ; it is continuous or indefinitely divisible since we can 
continue indefinitely to insert numbers between any two given 
numbers. In the present chapter, the ideas of a limit, of the infinite, 



88 MATHEMAtiCAL ANALYSIS FOR ECONOMISTS 

and of continuity will be developed. But the development must not 
be allowed to hide completely the fundamental ideas we have 
attempted to set out briefly above. 

4.2 Examples of the limit of a function. 

Our next consideration is the extension of the concept of a limit, 
as described above, to apply to functions. The most convenient way 
of introducing the variety of cases of a limit of a function is by 
considering a number of actual examples. A careful study of the 
examples below is recommended ; they will be found to cover most 
of the possible cases of limits. We can then proceed to more precise 
definitions. 

Ex. 1. y = l--- 

X 

Giving X in turn the sequence of positive integral values and the 
sequence of negative integral values, the corresponding sequences of 
values of y can be obtained from the function as shown below : 


X 

1 

2 

3 

4 

5 

6 

7 

... 

y 

0 

i 

i 

i 

1 


« 

7 

... 


X 

-1 

-2 

-3 

-4 

-6 

-6 

-7 

... 

y 

2 

3 

2 


A 

4 

f 

i 

f 

... 


The curve representing the function, 
a rectangular hyperbola, can be 
graphed from these tables (Fig. 28). 

The tables show four sequences of 
numbers which have been met before. 
The two sequences of the first table 
are associated in such a way that, 
as the a;-sequence increases indef- 
initely through integral values, the 
y-sequence increases and approaches 
the limit I. The variable x is 
actually continuous but it is clear 
that the insertion of further values 
into the a;-8equence makes no essential difference ; the y-sequence always 
increases to the limit I. This idea of a, correspondence between x- and 
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M-sequences is expressed by saying that the function « = 1 - - tends to I as 

X 

X tends to infinity. The graph of the function illustrates this fact. As 
X increases through positive values, the curve shown in the graph rises 
from left to right and approaches the horizontal line drawn at unit 
distance above the axis Ox. 

In the same way, the second table provides a further pair of associated 
sequences, the a;-sequence tending to minus infinity and the f/-sequence 

decreasing to the limit 1. This enables us to say that the function y = 1 

tends to \ as X tends to minus infinity. The graph again illustrates the 
tendency to the limit, the curve falling from right to left and approaching 
the same horizontal line as before as x decreases through negative values. 


Ex. 2. y=a;^+3a;-2. 

The following tables of corresponding values are obtained : 


X 

1 

2 

3 

4 

6 

... X 

-1 

-2 

-3 

-4 

-5 

-6 

... 

y 

2 

8 

16 

26 

38 

y 

~4 

-4 

-2 

2 

8 

16 

... 


It follows that, when x is given an increasing sequence of values tending 
to infinity, the corresponding sequence of values of y also tends to infinity. 
A similar result holds when x is given a decreasing sequence of values tend- 
ing to minus infinity. Hence, we say that the function y + 3a; - 2 tends 

to infinity as x tends to infinity or to minus infinity. Fig. 4 illustrates these 
facts, the curve shown (a parabola) rising indefinitely to the right and left. 

In the same way, the function y =a;^ - 3a; - 3 tends to infinity as x tends 
to infinity. But as x is allotted a decreasing sequence of values tending 
to minus infinity, the corresponding values of y are seen to form a similar 
sequence. Hence, the function y -3x -3 tends to minus infinity as 
x tends to minus infinity. Fig. 12 illustrates this limiting tendency, the 
curve shown falling indefinitely to the left. 

Ex. 3. y=|i. 


The following tables of corresponding values are constructed : 


X 

m- 

±1 

±2 

±3 

±4 

±5 

... X 

±i 



±i 

• 

ii 


y 

3 


i 

A 

A 

y 

3 

12 

27 

48 

76 

• 


and 4ihe graph gbown m Fig. 29 i| obtained. 
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In this case, it is immaterial whether positive or negative values of x 
are taken, the value of y being independent of the sign of x. As x is given 
an indefinitely increasing sequence of values, the corresponding sequence 

3 

of values of y gets smaller and smaller. Hence, the function y—-i tends 

X 

to zero 08 x tends to infinity or minus infinity. The value of y is not defined 
at all when x is zero. But, as x is given a decreasing sequence of values 
tending to zero, the corresponding sequence of values of y is seen to in- 


crease indefinitely. Hence, the function y = -tl tends to infinity as x tends 


to zero. The graph of the function 



illustrates these limits. The ciu*ve 
falls and approaches the axis Ox as 
we move to the right or to the left 
indefinitely, and it rises indefinitely 
and approaches the axis Oy as 
x decreases through smaller and 
smaller values to zero. 

3 

The function 2/ = - » shown graph- 
ically in Fig. 5, behaves in a similar 
way, except in one important par- 
ticular. As X decreases in numerical 
value, the numerical value of y 
increases indefinitely, but y is 
positive when x is positive and 


negative when x is negative. Hence, the function y = - tends to infinity as 

X 

X tends to zero through ‘positive values^ and tends to minus infinity as x 
tends to zero through negative values, ' 

^ A 2x + l 
x-l 

The graph of this function is shown in Fig. 33 below, the curve repre- 
sented being a rectangular hyperbola. It is only necessary to construct 
suitable tables of corresponding values of x and y, particular attention 
being paid to the approach of x to the value 1 (where y is not defined). It 
follows that the function y tends to 2 as x tends to infinity or to minus 
infinity, ^ Further, though not defined when a; = 1 , the functionary tends to 
infinity as x tends to 1 {through values greater than 1) and to minus infinity 
as X tends to 1 {through values less than 1). 

£,. 6 . , = 


X 
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In this case, no value of y corresponds to zero value of x since, if we 
substitute a;=0, the expression for y takes the meaningless form of zero 
divided by zero. The value of y is, however, obtained for any non-zero 
value of X and the following tables can be constructed : 


X 

1 

i 


1 


... X 

-1 

-i 

-i 

-i 

-i 

... 

y 

3 

n 

2i 

2i 


y 

1 

li 

If 

li 

If 

... 


From these tables, and from the corresponding graph of the function 
shown in Fig. 30, it is evident that the value of y approaches the limiting 
value 2 as a; is given a sequence of smaller and smaller values, whether 
positive or negative. Though x can never assume the value zero nor y 

attain the value 2, we can say that the function y=-{(l+a;)^-l} tends 
to 2 as X tends to zero. 


Ex. 6. Consider the step-function of 
Ex. 7 in 2.1 above. At the value a; = 3, 
the corresponding value of y = 6. Further, 
as X approaches the value 3 through values 
smaller than 3, the corresponding value of 
y remains 6. But, as x approaches 3 
through values greater than 3 (e.g. as x 
decreases from 4 to 3), the corresponding 
value of y is fixed at 7. Hence, we must 
say that, for this step-function, y tends to 
the value 1 as x tends to 3 {through values 
greater than 3) and tends to the value ^asx 
tends to 3 (through values less than 3). The 
nature of the graph of the function (Fig. 
illustrates this unusual limiting case. 



8) near the point where x = 3 


4,3 Definition of the limit of a single- valued function. 

The examples of the previous section show that the notion of a 
limit of a function is to be derived from that of the limit of a sequence 
of numb(Jrs. If y is a given single-valued function of x, a sei^uence of 
values can be allotted to x according to some prescnbed rule and the 
limit of the function is then dependent on the behaviour of^the 
corresponding sequence of values of y. The limit is essentially a 
relative concept, the walue ap^oached by the sequence oi y the 
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independent variable x is made to change in some given way. We have, 
therefore, to fill in the two blanks in the stock phrase 

y tends to , as x tends to 

The second blank is at our choice ; the first blank is then determined 
from the behaviour of the function. 

The following definitions include all the cases of the limit of a 
function y=f(x) which is explicit and single-valued.* 

(1) The limits of f{x) as x-^ ±cto . 

For the limit of the function as x tends to infinity, we observe the 
behaviour of the sequence of values of the function corresponding to 
a selected sequence of indefinitely increasing (positive) values of x. 
There are four possibilities : 

(а) The values of f{x) may be positive and becoming larger and 
larger as x increases. The value off{x) can be made to exceed any 
given number, no matter how large, simply by allotting a sufficiently 
large value to x. Here, we say that f(x) tends to infinity as x tend? 
to infinity ”, a phrase which can be symbolised 

f{x)->oo as x-^oo or Lim /(a:) = QO, 

*-►00 

In this case, the curve y—flx) rises indefinitely to the right (the axis 
Ox being horizontal) as shown in curve (a) of Fig. 31. 

(б) The value o{f{x) may be negative and numerically larger and 
larger as x increases. This case is similar to the first and we say that 
“/(x) tends to minus infinity as x tends to infinity ”, the symbolic 
expression for which is 

f{x)-> - 00 as x-^oo or Lim f{x) = - oo . 

*-►00 

The curve y=zf[x) now falls indefinitely to the right (see curve (6) 
of Fig. 31). 

(c) The values of f{x) may form a sequence approaching a definite 
value A as a: increases. The value off(x) can be made to differ from A 
by as little as we please simply by allotting to a; a sufficiently large 
value. Here we say that “/(^) t^iids to the limit A as a; tends to 
infinity ” and we write * 

as a:->oo or Lim /(a;)=A. 

*-►00 

* The definitions given here, though clear enough in a general way, are not 
sufficiently rigorous from a purely analytical point of view. For more precise 
definitions, see Hardy, op. cit., pp. 162-8. # See also Examples IV, 19 anil 20. 
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The curve y=f{x) now approaches nearer and nearer (from one side 
or the other) the fixed horizontal line y=A as we move indefinitely 
far to the right (see curve (c)’ of Fig. 31). 

(d) The values of f(x) may behave in none of the previous three 
ways, oscillating without settling down to any “ trend as x in- 
creases. In this case, the function f(x) has no limiting value as x 
tends to infinity and the curve y=f(x) continues its finite oscillations 
as we proceed indefinitely to the right (see curve (d) of Fig. 31). 





The following examples illustrate this class of limits : 


!--->! 

X 

as 

X->oo ; 

2x + 1^2 

x-l 

as 

a:->oo ; 

x^ -{-3x - 2-^00 

as 

x->oo ; 

x^-3x- 3->oo 

as 

x-^co ; 

o 

t 

as 

a;->oo ; 

?->0 

X 

as 

x->oo . 


The limit of f{x) as x tends to minus infinity is (fefined and denoted 
in similar ways. For example : 

' 1 

j . 1 ►! as ^->- 00 ! as 

X • 
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(2) The limits of f{x) as x-^a. 

The problem of definition here is to trace the behaviour of 
the values of f{x) as the variable x is allotted a sequence of 
values approaching nearer and nearer to the given value a;=a. In 
no c€tse does x ever actually attain the value a;=a in the limiting 
process. 

Suppose, first, that the a;-sequence approaches x=za only through 
values greater than a. The corresponding sequence of values off(x) 
is subject to the same four possibilities as before, i.e. the function 
f{x) can tend to infinity, to minus infinity, to a finite limit A or to no 
limit at all as a; approaches a through values greater than a. The 
symbolic notation here is similar to that adopted above, the phrase 
“ z tends to a through values greater than a being used and written 
“ For example, 

f{x)^X as or Lim /(x)=A 

indicates the case where f{x) tends to the limit A. 

Similar remarks apply when the x-sequence is taken through 
values less than a. Here, x tends to a through values less than a 
(written x->a - ) and we have, for example, 

f{x)->X as or Lim /(x)=A. 

X— ►a— 

Finally, if it is found that/(a:) tends to the same limit as x-^a -f 
Btnd as x-^a - , we say that f{x) tends to this limit as x tends to a 
(written x-^a). So, for example, 

/(a;)->A as x-^a or Lim f{x)=X 

x-^-a 

implies that f{x) tends to the limit A as a; tends to a through values 
both greater arid less than a. This is the most important case in this 
class of limits. If the limits as x-^a + and as x-^a - are different, 
it is not possible to combine them in one notation. The diagrammatic 
significance of limits as x tends to a is shown in Fig 32, which exhibit 
the curve y=f{x) in the neighbourhood of the point x=a. In the 
cases of curves (a) and (6), f(x) has a single limit as a:->a, the limit 
being finite iji one case and infinite in the other. In the cases of 
curves (c) and (d), the limits as x-^a H- and as z-^a - are different, 
both being finite in one case and both being infinite in the 
other. 
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The following examples illustrate the possibilities : 


3 

_ _>oo 

as 

o 

t 

X 

as 

x->o ; 

3 

— ycc 

X 

as 

x-)-0 + ; 

2x+\ 

->GO 

x-l 

as 

x->l + ; 

— 00 

as 

0 

1 

— > — 00 

as 

X — “ . 


The step-function of Ex. 7 of 2.1 above gives 

t/->7 as -f- and as . 

All the cases of Fig. 32 are illustrated by these examples. 




4.4 Limiting and approximate values. 

It is important to point out and emphasise the connection between 
limiting and approximate values. If a sequence of numbers tends 
to a fiaite limit, then, by definition, the numbers of the sequence 
approach nearer and nearer to the limiting valde without ever at- 
taining it. It follows that any number of the sequence is an approxi- 
mate value of the limit, and conversely. Further, the approximation 
beipomes closer the /arther advanced in the sequence is the number 
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taken. Thus, no number of the sequence 

h f> l> if 6» ••• 

is ever equal to the limit 1. But 1 is an approximate value of any 
number of the sequence, and the farther we proceed through the 
sequence the closer is the approximation. 

Similar remarks apply when we come to the case of a limit of a 
function. To fix ideas, suppose the function f{x) tends to a finite 
limit A as a; tends to infinity, i.e. the value of f{x) approaches, but 
never attains, the value A as a; increases indefinitely. It follows that 
A is an approximate value of the function f{x) when x is large and 
the approximation becomes closer as we allot larger values to x. 
For example, since we know that 
2a; 4-1 ^ 

— >2 as a;->oo, 

a;- 1 


it follows that this expression is approximately equal to 2 when the 
value of a; is large. And the larger the value of x taken, the better 
is the approximation. To check this statement, put a; = 100 : 


2a; 4-1 201 


a;- 1 


= - = 20303, 


I 




which is within 0-04 of the limiting value 2. 

The limit of a function is thus an abstract notion in the sense that 
the fimction never attains its limiting value. On the other hand, 
this abstract notion is precise and does correspond to a mathemati- 
cally significant property of the function. One of the main uses of 
the limiting concept is derived from the definition ; th e limitin g v«l^^ 
serves as an appro xima^ v alue of the func tion for large values of x 
(if the li mit is obta ined as x tends to infinity) or for values of x near 
a (if the limit is obtainedTas x tends to a). In short, the precise 
notion of a limit covers the vaguer no tions of approximate values. 

In conclusion, it is hardly necessary to stress the fact that the 
symbol “ oo which appears in the S 3 mabolic expression of certain 
Umits does not stand for a number “ infinity The symbol means 
nothing by itself and must never be detached from its contexts The 
phrase “ x lends to^infinity or its symbolic expression “ a;->Qo ”, 
is to be taken as a whole ; it is simply a convenient way of expressing 
the faci; that the variable x is being allotted a sequence of larger and 
larger yiwi^e values and that the corresp 9 nding variation in the valv.e 
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of the function is under observation. Similarly, when we use the 
phrase f{x) tends to infinity we must remember that the values 
of f{x), however large they may be, are always finite values. The 
development here given of infinite limits involves only finite numbers 
together with the idea of processes which can be applied to finite 
numbers repeatedly and without end. 

4.5 Some properties of limits. 

Two single-valued functions ^(a:) and 0(x) are given and it is 
found that (f>(x)^X and «/r(a:) as a:->oo , where A and fi are finite 
values. The following results can then be established : 

(1) <f>(x) -\-iIj{x)^X-\‘Ijl as z-^oo, 

(2) <!> (x) - ip (z) X - fjL as x->oo, 

(3) <f) (z) , ip (x) XfjL as z-^ao. 

From our definitions of hmits, it appears that these results are 
too obvious to need formal proof. Since the value of p{x) approaches 
nearer and nearer to the value A, and ip (x) nearer and nearer to the 
value /I, the value of (e.g.) the sum of these two functions must 
approach nearer and nearer to the value (X -h/i) as z increases 
indefinitely. Hence, (A -f/x) must be the limit of {(p(x) -i-ip(z)} as 
z tends to infinity. Exactly equivalent results hold for hmits as z 
tends to minus infinity or to a finite value a. Further, the results 
extend, in an obvious way, for cases of sums, differences and pro- 
ducts or quotients involving more than two functions.* 

We are now able to evaluate the limit of a comphcated function, 
making use only of the hmits of the simpler functions included in its 
expression and the general results set out above. The method is 
illustrated by the following examples : 

Ex. 1. The constant value 2 has, of course, the Hmit 2 as a;->-oo . 

3 

The function - ->0 as x-->oo . Hence, by result (1), 

' * 3 

2 + --^2 as *->oo. ' 

X 

* With more strict definitions of limits (see 4.3, footnote), the results 
here given are rigidly proved only with some difficulty. See Hardy, op. cit.^ 
pp.* 125-30. « 
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Ex. 2. 


We know that 


2a; + 1 
a;-l 


-^2 and 



1 as a; 00 . But 


x^+3x-l 2a; + 1 a;--l_2a; + l A IN 
a;(a;-l) a;-l x a;-l \ xJ* 


From result (2), it follows that 


a;^ 4 - 3a; - 1 
x(x -1) 


as a; 00 . 


Ex. 3. 


Since and -- 

a; - 1 x 


>►0 as a; ->oo , the result (3) shows that 


2a; + 1 ^ 

fi-s a;->oo. 

a:(a; -1) 


It is to be noticed that the results do not necessarily hold when 
the limit of either of the functions <f>(x) and f/f(a;) is infinite. If, for 
example, both fimctions tend to infinity as x tends to infinity, then 
we can deduce nothing about the limit of the quotient of the 
functions. Numerator and denominator both increase indefinitely 
'as X increases and the quotient can vary in diverse ways according to 
the forms of the two functions. If <f>(x)=2ijj{x) the quotient has the 
limit 2, while if ^ (a;) = J 0 (a;) the quotient has the limit And there 
are many other possibilities. Care must be taken, therefore, in the 
use of the above results to avoid reaching fallacious conclusions 
about limits of functions made up of simpler functions with infinite 
limits. 


4.6 The continuity of functions. 

The definition of a function is based on the correspondence be- 
tween values of two variables ; to each value of one variable there 
corresponds a definite set of values of the other variable. The 
variables themselves may be continuous (in the sense that the 
number system is continuous) or they may be discontinuous. But 
the definition of the function is quite independent of any idea of 
continuity. The substitution of a value of one variable in tke fimc- 
tion, to give the corresponding values of the second variable, is quite 
independent of the substitution of other values. 

The step we now propose to take is to extend the idea of con- 
tinuity to qualify functions, to distinguish between functions which 
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are continuous and functions which are discontinuous. The distinc- 
tion can be expressed, in a rough preliminary way, by saying that a \ 
function ia-only continuous if the related variables vary continuously 
together. The variables must be separately capable of continuous 
variation. But this is not enough ; the continuous variation of the 
two variables must be synchronised. 

The discussion is again restricted to the case of a single -valued 
function of a continuous variable: y=f(x). The general idea of a 
continuous function applies to the function as a whole and involves 
the continuous variation of as x varies continuously. When we 
attempt to formulate a precise mathematical definition, however, we 
find it necessary to define first the concept of continuity at a pointy 
i.e. at one value of x. Point continuity is mathematically more 
significant than general continuity. 

The first requisite for the continuity of the function at the point 
a;=a is that /(a;) should be defined both for the value x—a and for 
aU values of x near x — a. There are now several possibilities con- 
cerning the variation of f{x) as x approaches x = am relation to the 
value /(a) assumed at x — a. Firstly, f{x) may tend to no limiting 
value or to two different limiting values as x tends to a from one side 
and from the other. Secondly, f(x) may tend to a definite limiting 
value as x tends to a from either side but this limit is different from 
f{a). Thirdly , /(a:) may tend to the value / (a) as x ^ends to a from 
either side. It is only in the third case that the function is described 
as continuous at a;=a. 

In diagrammatic terms, the function y=f(x) is represented by a 
curve in the plane Oxy which is cut by lines parallel to Oy in single 
points. Two conditions are needed if the curve is to be continuous 
at the point where x=a. Firstly, there must be a definite point P 
on the curve at a;=a ; there can be no gap in the curve. Secondly, 
as we move along the curve towards the value a:=a, we must arrive 
at this point P whether we approach from one side or the other ; 
there can be no jump in the curve as we pass through P. A curve is 
continuous at a point x=a, therefore, if there is neither a gap nor a 
jump in the curve at the point. In Fig. 32, the curves (a) is the only 
one which is continuous at x=a. Curves (6) and (d) are ^con- 
tinuous because there is a gap in the curve at the point, and the 
curve (c) because there is a j^mp in the curve. 
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The formal definition of continuity of a function at a point is : 


\ Definition : The function y=:f{x) is continuous at x-a if 
/ kl) f {a) exists and is finite ; 

^ |(2) f{x) exists and is finite at all values of x near x=a m such a 
i way that Lim / {x) — /(a). 


The definition is equivalent to the identity of limit and value of f{x) 
at the point, and it is essential to see the necessity of the two con- 
ditions — xhat/(a;) assumes a definite value /(a) at the point and that 
f{x) tends to the same value as x approaches a from either side. The 
conditions correspond to the diagrammatic conditions that the curve 
yz=zf(x) should have neither gap nor jump at the point in question. 

It is now a simple matter to extend the definition to include con- 
tinuity of a function in general. A function is continuous over a 
range of values of a; if it is continuous at every point of the range. 
lOn the other hand, a function is not continuous if it exhibits at least 
one point of discontinuity in the relevant range. 


4.7 Illustrations of continuity and discontinuity of functions. 

If a function is continuous over a range, it is continuous at every 
point of the range and can be represented by a curve which has no 
gaps and no jumps in the range concerned. This implies, roughly 
speaking, that the curve can be drawn graphically on squared paper 
without taking pencil point from paper. It is important, however, 
to stress the fact that a continuous curve is not necessarily a 
“ smooth ” curve (using the everyday meaning of the term 
“ smooth ”). Smoothness is, in fact, more limited than continuity.* 
In Fig. 33, the curves (a) and (6) are both continuous over the whole 
range. The first of these, the parabola y = ^(x^ -i-3x-2), is smooth 
as w;ell as continuous. The second is the curve drawn from the 
equation j/ = l + - 1)^ and it is continuous since it can be drawn 

without taking pencil from paper. But it is not smooth in the usual 
sense since it has a “ sharp point at the point where x = l. ® 

If a fimction,is discontinuous in a range, it must have at least one 
point of discontinuity where one or other of the conditions for con- 

♦ Smoothness of a curve is a concept connected with the “ derivative ” of 
the function (see 6.6 below). ^ « 
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fcinuity fails to obtain. In diagrammatic terms, the curve represent- 
ing the function has at least one gap or jump somewhere in the range 
concerned. T he most important e xample of a discontinuity arising 
fr om the non-existence of some point on the curve, he. by reMQILjOf 
a ^ap in the cur ve, is that o f an infinite discontinuity. The walue of 
the function y=f{x) at such a point a: = a is infinite in the sense that 
/(a:)->± oo as x->a. The curve ‘‘ goes off to infinity ” as we approach 
the point from either side. This kind of discontinuity is of frequent 
appearance even when we consider quite ordinary algebraic functions. 




Fio. 33. 

The function which represents a rectangular hyperbola with asymp- 
totes parallel to the axes (i.e. the fimction which is the ratio of two 
linear expressions in x) always includes one infinite discontinuity, at 
the point where we have the vertical asymptote of the curve. The 
curve "(c) of Fig. 33 shows a rectangular hyperbola with an infinite 
discontinuity at a; = 1. Infinite discontinuities &iay troublesome 
but they can be allowed for, in mathematical analysis, by the^deyio e 
of infinite limits. 

•The case where the discoiatinuity arises because of a jump in the 



102 MATHEMATICAL ANALYSIS FOR ECONOMISTS 

curve is very different. The curve {d) of Fig. 33, corresponding to 
the step-function of Ex. 7 of 2.1 above, displays six points of dis- 
continuity of the “ jump ” kind. Such a discontinuous function is 
clearly of little service in mathematical analysis, and it cannot be 
represented by a simple analytical formula of the kind we usually 
consider in mathematical work. But, as we shall see in the following 
chapter, it is the kind of function that applies most closely to many 
problems of economics and other sciences. We are here faced with 
the difficulty that the functions most suitable for the description of 
economic phenomena are often those which are most inconvenient 
from the point of view of mathematical analysis. Some way 
out of this difficulty of the discontinuous element in economics 
must be found if the full power of mathematical analysis is to be 
applied. 

4.8 Multi-valued functions. 

Limits and continuity have been discussed above only in relation 
to single-valued functions. It is possible, however, to extend the 
definitions to apply to functions which are multi-valued and given 
in implicit form. Since no essentially new ideas are involved, it is 
not necessary here to consider the extended definitions in detail. A 
brief general account is all that is required. 

An implicit relation between x and y usually gives t/ as a multi- 
valued function of x. The ideas of limits and continuity, as applied 
to such a function, necessitate the separate treatment of each of the 
several values of y that correspond to any given value of x. If the 
function is easily divisible into a number of distinct “ branches each 
of which is single-valued (see 2,4 above), there is Uttle difficulty. 
The limit of each “ branch ’’ of the function, as x tends to infinity 
or to a definite value a, is defined as before and so the limits of 
the whole function are described. Further, each “ branch ” can be 
considered separately from the point of view of continuity. All 
“ branches ** may be continuous or one or more “ branches ** may 
show points of discontinuity. • 

In diagramn^atic ferms, taking the axis Ox as horizontal, the curve 
representing y as a multi-valued function of x shows several ‘‘ sec- 
tions '^in vertical line with each other. Each “ section ’’ corresponds 
to one single-valued “ branch ” of the fqnction. AU sections may lie 
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continuous and fit on to each other in such a way that the whole 
curve can be drawn without taking pencil from paper. On the other 
hand, points of discontinuity (whether of the infinite or “ jump ** 
kind) may appear on one or more sections of the curve. 

To take an actual example, the relation a;® — = 9 defines y as a 

double-valued function of x with the single-valued branches 

y = Jx^-9 and y=-Jx^-9, 

where we can consider x as taking values numerically greater than 3. 
Each branch is found to be continuous at all points. For the first 
branch, y->-oo as a;->±oo. For the second branch, y->-oo as 
x^±co . Fig. 34 shows the graph of the function and illustrates the 
fact that it is a continuous function 
with the specified limiting ten- 
dencies. The curve is an example 
of a rectangular hyperbola but the 
asymptotes of the curve (shown 
by broken lines in Fig. 34) are not 
parallel to the axes of co-ordin- 
ates. The double-valued function 
a;* 4 - 1 / 2 ^ 10 ^ corresponding to the 
circle of Fig. 6 , can be shown to be 
continuous in the same way. 

When a multi-valued function Fig. 34. 

cannot be divided easily into dis- 
tinct “ branches ”, some difficulties emerge. But the ideas of limits 
and continuity still apply in general. We can instance the triple- 
valued function x^ +y^ - 3xy = 0. The graph of this function (Fig. 7) 
indicates that it is continuous, being a looped curve with three 
sections fitting continuously on to each other. 

EXAMPLES IV 
Limits of functions 

1. By giving x an arbitrary sequence of increasing positive values, find 
the limit, as a;-)>oo , of each of the following functions ; 

1 1-35 1 -a;* 2a5*“}-a; + l 1 -f 2a; + 3x* 

2a; 4-1’ 14- a;’ l+2a;’ 6a;’*+a;-2’ l+2a;-6a;* 

Illustrate graphicallj^. Are the«same limits obtained as a; - oo T 




104 


MATHEMATICAL ANALYSIS FOR ECONOMISTS 


2. What are the limits, as , of the functions : 

,,* + 1 2* + l 7a:* +*-2 2a:*+3a: + l 

^*’^*-1* 3x*-4a: + l' 3x* - 4x + 1 ^ 

Noticing that (c) is the sum of (a) and (b), and that (d) is the product of 
(a) and (6), verify the general results that the limit of a sum or product is 
the sum or product of the separate limits. 

a? + 1 

3. Explain why the limit, as x-^oo , of cannot be obtained by 

j 3a? 1 

multiplying the limits of (a? + 1) and . 


4, Show that 


x-lx*-2 l+x-x* 
a?4-2~a? + 2^ a; + 2 


Why cannot the limit, as a:->oo , of bo deduced as the sum of the limits 

of the other two expressions 7 a: + 2 

3a; + 1 3 

5. Show that ^ as a;->oo . What percentage error is involved by 

4x — o 4 

taking } as the value of this expression in x when a; = 10 and when x = 100? 
How large must x be for the expression to differ from } by less than *006 7 

6. Establish the general result that, as , 

a^x + bi 

Otjpc ~{~ b^ 

by writing numerator and denominator as x and x 4- respec- 

tively and by using the general results for the limits of sums, products and 
quotients. Check the validity of the general result by reference to the 
particular cases given in previous examples. 

Cl X '\~b d 

7. Find an expression for the difference between — r-^ and — and show 

O2a;+o, o, 

that it tends to zero as x->oo . Hence deduce the general result of the previous 
example. 

8. By methods similar to those of the previous two examples, show that 

aiX*4-6iX+Ci a, 

— I .*_>—? as x-^co , 

02X^4- O2X+C, Ua 

What modifications are necessary when Oi or aa is zero? Check this result 
by reference to the particular cases of the first two examples above. 

9. If y =aa;* 4-6x 4-c, show that y-^±co as x->oo , How does the sign of a 
affect the result? 

10. If X is positive and y = 2x-34*-, show that y-^co as x-^0 and as 

X 

x-^QO . Verify that these results can be obtained by adding the limits of 
(2x - 3) €ind - evaluateti separately. 


11. A relation between x and y is defined by x=f*, y = 2L Show that 
x-^flo as y-^co , and conversely. Can an exact meaning be attached to the 
statement that x tends to infinity “ faster ” ^6tn y? • • 
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12. Show that -{(1 +a?)* as x-^0. Generalise by finding the limit 

X j 

as af-^0 of the expression - {(a +»)* - a*}. 

13. Show that - T— as x->0, and generalise by finding the 

X Vi -^X J 

limit of - { — as a?->0. 

x\a-\-x aJ 

14. What error is involved by taking - 1 as the value of 
^* = 0- 1 and when a; = 0*01 ? 

15. Verify that -{1 - Vl -«} is equal to the reciprocal of {1 + Vl - a;} as 

® I 

long as x^O, Deduce that, as x-^0, -{1 - Vl By a similar method, 

j ® 

show that -{n/1+x - Vl -x}->l as x->0. 

X 

16. A lead shot is allowed to fall in a vacuum. It is known that the shot 
falls 16x* feet in x seconds. Find an expression for the average nmnber of 
feet fallen per second between the ath and the (a4-x)th second. Find the 
limit of the expression as x->0 and give a meaning to this limit. 

17. P is the point with abscissa a, and Q the point with abscissa (a + x) on 
the curve y=x*. The line through P parallel to Ox cuts the line through Q 
parallel to Oy in the point L, The ratio of LQ to PL is the gradient of the 
chord PQ, Find an expression for this gradient in terms of a and x, evaluate 
its limit as x->0 and explain the meaning of the limit. 

(The results of this and the previous example illustrate problems of funda< 
mental importance in the development of the following chapters.) 

18. A right-angled triangle has two equal sides of 1 inch. One of these 
sides is drawn horizontally and divided into (n + 1 ) equal portions. On each 
of the portions after the first a rectangle is formed with height equal to the 
vertical distance from the left-hand end of the portion to the hypotenuse of 
the triangle. Find an expression for the sum of the rectangle areas and 
evaluate the limit of the sum as n->-QO . What is the meaning of the limiting 
value ? 

19. Show that a more rigid definition of the idea of a limit of a function is 
provided by the following : 

The function f(x) tends to the limit A as x tends to o if, for any selected 
small quantity €, we can find a small interval a-8<x<a-fS such that 
{/(x) - A} is numerically less than € for any x of the interval. 

20. In the same way, examine the more rigid definition ; 

The function /(x ) -►00 as x->oo if, for any quantity b no matter how large, 
we can find a quantity a so that /(x) >b for all values of x >o. 


Hra-} 


when 


Continuity of functions 
1 *“ X 

21. Examine the function y = from the point of View of continuity and 


illustrate graphically. 


22. From the function xt/-l-2x-fy-l=0, find the limit of t/ as x-> 1, and 

the limit of x as What restriction must be added to the statement that 

(/ a continuous fimdtion of x, (vid conversely T 
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23. In the general relation type axy-^-bx +cy -fd =0, show that y is a single- 
valued function of x which is continuous except for an infinite discontinuity 
at one point. Show also that a; is a similar function of y. Relate the result 
to the properties of the rectangular hyperbola. 


24. For what values of x does y=- 


a;-}- 1 


tend to infinity? Indicate 


(a; + 2) (a; + 3) 

the form of the graph of the function and describe its discontinuities. 


25. What are the discontinuities of the function 

2x* -f 3a: -I- 1 . 

y= ^ . . .-t 


3a;* - 4a: + 1 


Illustrate graphically. 

26. Show that the function y- 


aia:*+6ia:-»-Ci x j ^ -x r 

= _L_^ — f never tends to infinity if 
Uja:* + Oga: + c, 

6,*<4a,c*, and tends to infinity for two values of a: if 6j*>4a2C,. Examine 
also the case where 6|*=4a,c,. Interpret in terms of the continuity of the 
function. 


jc* — 1 

27. Show that y — - is continuous except at a: = 1. What is the nature 

of the discontinuity at this point? Show that the function becomes com- 
pletely continuous if the value t/ =2 at a: = 1 is inserted. 


28. Describe the discontinuities of the step-function, illustrating with the 
function defined by the table : 


0<x<4 

4 < x<. 6 

6<a:< 10 

II 

o 

■Bi 

o 

II 


29. The function / (a:) is defined as taking the value x when x is positive and 
the value ( - x) when x is negative. Deduce that the function is continuous 
but has a “ sharp point ” where a: = 0. 

30. If o is a fixed positive number and if n-^oo through integral values, 
show that a”->0 if a< 1 and that a”->oo if o>l. What can be said of the 

case 0 = 1? Deduce the limit of as n->oo for various fixed values of o. 

l+o'* 

81. The function f{x) is defined as taking its value from the limit of 

as* n tends to infinity. Describe the variation of the function for positive 
values of x and illustrate graphically. Show that the function is discontinuous 
at a; = 1. What is the nature of the discontinuity? 

32. Draw the graph of the function 4a:* + * = 36 and show that it repre- 

sents a continuous oval curve. 


33. The relation (1 -a:)* -xy' =0 defines y as a double-valued function of x. 
Obtain the two single-valued “ branches and draw the graph of the function, 
showing that the curve is continuous with a “ sharp point ** at (1, 0)^ WTiat 
are the limits of y as a:->0? 

• • 

34. Show that the curve representing the double- valued function 

i/*=(a:~l)*(a:-2) 

c€«i be*taken as continuous, except that tliore is a point (1, 0) which must be 
regarded as a part of the curve but completely separate from all other parts 
of the curve. • • * 






CHAPTER V 


FUNCTIONS AND DIAGRAMS IN ECONOMIC THEORY 
5.1 Introduction. 

Economics is an analytical study, concerned with the relations that 
exist, or can be assumed to exist, between quantities which are 
numerically measurable. As instances of variable quantities in 
economics, we need refer only to prices, interest rates, incomes, 
costs of production, amounts of goods bought or sold on a market 
and amounts of factors of production employed by a firm or industry. 
Some of these quantities can be measured in physical or “ natural ** 
units, others only in money or ‘‘ value ’’ units. The relevant point 
here is that they are measurable in some units. There are, however, 
other concepts of considerable importance in economics, concepts 
which are ordered magnitudes rather than measurable quantities and 
which can only be represented by “ indicators or ‘‘ index-numbers 
All concepts of “ utility ’’ or “ satisfaction '' are of this nature, as 
are notions of the general level of prices or production. But, for the 
moment, we content ourselves with the fact that economic measur- 
able quantities exist and leave the “ ordinal ’’ concepts for later 
consideration. 

It follows that there can be no doubt that mathematical methods 
are possible in economics and that economic relations are expressible 
by means of mathematical functions. Whether it is helpful to intro- 
duce the mathematical technique is another question and it is not 
proposed to devote any space here to a discussion of this purely 
methodological matter. It is sufficient that mathematical analysis 
is applicable and can be called into service when convenient. 

An important point is that the relations of econoipics, and the 
functions which express them, are usually of unspecified or unknown 
form. It is not often that we can say that an economic fimction is 
(e.g.) of linear or qimdratic form, though it is sometimes convenient 



108 MATHEMA^nCAL ANALYSIS FOR ECONOMISTS 

to assume that it can be approximately represented in one of these 
ways. Even in the most general case, however, the economic con- 
ditions of the problem impose certain limitations on the form of the 
functions. By considering the economic nature of our problem, we 
are usually able to say that the fxmction concerned has the mathe- 
matical property (e.g.) of being single-valued and decreasing, or of 
being represented by a U-shaped curve. And this is enough for the 
profitable use of mathematical analysis. 

Analytical methods, then, are directly applicable to economic^ 
problems. But diagrammatic methods are also of great service. 
Any economic problem capable of symbolic representation can, in 
general, be illustrated with the aid of diagrams. There is a curve 
corresponding to each function we use to interpret a relation between 
two economic variables ; we have demand or cost curves as well as 
demand or cost functions. Diagrams displaying the properties of 
such curves and relating one curve to another are extremely useful. 
At least they throw into prominence just those points we are attempt- 
ing to make in our main argument. 

We can now pass to a consideration of some of the functions and 
curves commonly employed in economic analysis, stressing their 
general forms and giving an indication of the more limited forms to 
be assumed in “ normal ” cases. Each problem must be considered 
on its own merits and allowed to dictate what is or is not to be 
regarded as ‘‘ normal Mathematics is the seiyant of the sciences, 
not the master. The present chapter is divided into two main 
sections. In the first section, we are concerned with those economic 
functions which are expressible directly in explicit and single-valued 
form, with demand and cost functions in particular. In the second 
section, we pass to more complicated cases where the economic 
functions employed appear in implicit, and not necessarily single- 
valued, form. Here we are concerned with function types and 
curve systems (see 3.7 above) as applied in the interpretation of 
“ indiBFerence maps 

• 

5.2 Demand functi5ns and ciures. 

Our first problem is to express, in symbolic and diagrammatic 
form ,* the conditions of demand on a consumers' goods market. We 
limit our discussion to the case where there ^is pure compeUbion 
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amongst consumers, where (to use Professor Frisch’s term) each 
COnsuiher~ acts ImT^ a “ quantity adjuster This case occurs 
when no individual consumer has any direct influence on market 
pricesTfwhicTi are the same for all consumers), each consumer being 
ablFtb choose only the amounts of the various goods he will take at 
the ruling prices. In other cases (e.g. when consumers have some 
direct influence on prices), the construction of the demand relations, 
on the lines described below, automatically breaks down. 

Consider the amount of a definite good X demanded by a market 
consisting of a definite group of consumers. In order to obtain a 
simple reptesentatiorr of the demand for X, we must assume that 

u.^1) the number of consumers in the group, 

v(2) the tastes or preferences of each individual consumer for all 
the goods on the market, 

^(3) the income of each individual consumer, and 
«(4) the prices of all goods other than X itself 

are fixed and known. The amount of X each consumer will take can 
then be considered as uniquely dependent on the price of X ruling 
on the market. By addition, it follows that the total amount of X 
demanded by the market depends uniquely on the market price 
of X. The demand for X can only change if its market price varies. 

This expression of market conditions can be translated at once 
into symbolic form. Let p denote, in definite units, the market 
price of X, and let x denote, in definite units, the amount of X 
demanded by the market. Then a: is a single-valued function of 
p, which can be written, in general, in the symbohc form : 

a; = ^(p) the demand function for X. 

The variables x and p take only positive values. 

The demand function can be represented as a demand curve in a 
plane referred to axes Op and Ox along which prices and demands 
are respectively measured. Since we have taken price as the inde- 
pendent variable, we should, according to the mathematical con- 
vention® draw the axis Op horizontally. Since the days of Marshall, 
however, the economic convention has been otherwise, and the axis 
Op has been taken vertical. We shall here adopt the economic con- 
vention and all the demand curves we use are referred to a vertical 
price axis. ^ • 
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It must be emphasised, at the very outset, that ^e^demand 
relation is a completely static one and does not refer to changes in 
demand over time. The demand function and curve assume a fixed 
situation in which a set of alternative and hypothetical prices are 
given. These prices cannot be taken as those actually ruling on a 
market at successive points of time. 

The demand function and curve can only be specified if the four 
factors given above are known. The form of the function and the 
shape of the curve depend on these factors ; if changes occur in any 
of the factors, then the whole form of tlie demand function and curve 
changes. We can describe this by saying that a change in any 
one of the factors causes a shift in the position of the deinand curve, 
the new demand curve being different, in shape and location, fi;om 
the old in a way dependent on the nature of the change assumed in 
the factor. A shift in demand, corresponding to a change in the 
demand situation as a whole, must be carefully distinguished from a 
move along a given defiiand curve. The former implies a change of 
data and may easily take place over time : the demand curve cai? 
shift over time. .The latter implies a hypothetical and non-temporal 
change in price in a given situation. Another way of putting the 
point is to say that the demand function x==(f> (p) has form depend- 
ent on a number of parameters, e.g. the number of consumers, their 
incomes and tastes and the prices of other goods. The parameters 
are fixed for one demand function ; they assume different values 
only for a shift of deinand, for a change in the whole demand 
relation. 

To take an illustrative case, suppose the market demand curve for 
sugar has been fixed. The population now increases, the consumers’ 
tastes change in favour of sugar, the prices of tea and coffee fall and 
the income distribution changes so that the poor have more and the 
rich less to spend. We can say that the demand for sugar, at any 
one price whatever, is greater now than it was before. The whole 
demand curve for sugar has shifted to the right. 

We must now examine the general nature of the demand function 
and curve* in ^the ll^ht of the economic conditions of demand. The 
first point that arises here concerns the continuity of the variables 
X and p and the demand function itself and it is worth while treating 
this point at some length since our ren^rks apply, with little modifi- 
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cation, to other functions introduced later. It would appear that 
the demand for (e.g.) sugar can only be defined for a limited number 
of isolated prices, for (say) prices at halfpenny intervals^from Id. to 
6d. per lb. Given the relevant data, the demand ‘‘ curve ** for s^ar 
then consists of an isolated set of points in the plane Oxp, The 
adoption of such a discontinuous representation is clearly incon- 
venient for a theoretical and mathematical development of" th^ 
demand problem. Can price be taken, for theoretical purposes, as 
a continuous variable ? There is certainly nothing logically incon- 
sistent in the notion of a continuously variable, price. Further, an 
examination of actual practice shows that prices are less discontinu- 
ous than they appear at first sight. Sugar may be quoted at 2Jd. 
or 3d. per lb. But it is possible to insert intermediate prices by 
quoting sugar at two lbs. for 5^d. ; at three lbs. for 8d. or 8jd. ; at 
five lbs. for Is. Id., Is. l|d., Is. 2d. or Is. 2jd. ; and so on. Hence, 
imless there are strong reasons to the contrary, it can be safely 
assumed that the price in a demand relation is a continuous variable 
taking all positive values. The 
demand is then uniquely deter- 
mined for each price of this 
continuous range. 

But, even when p varies con- 
tinuously, it does not neces- 
sarily follow that X varies 
continuously with p, i.e. that 
the demand function is con- 
tinuous. On the contrary, it 
is reasonable to assume that 
demand varies by a series of 
“ jumps ” as price increases 
continuoMy . Demand may be insensitive to small price changes and 
“ jumps ” only when price has increased by a definite amount. In 
this case, the demand function has ‘‘ jump ” discontinuities at 
certain# prices and is a step-function of the kind already considered. 
The demand curve takes the form illustrated in Fig. 35. ‘ 

It is clearly convenient, for theoretical purposes, to assume that 
the demand function and curve are continuous, that demand Varies 
continuously with ^ice. There is, at least, nothing that violates our 
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logical sense in this assumption. And it is reasonable to suppose that 
any actual demand conditions, for a large market at least, can be 
“ fitted sufficiently well by a continuous demand curve for the 
latter to be assumed. Fig. 35 illustrates a hypothetical fitting of a 
continuous demand curve. It is not proposed, however, to consider 
here the “ fit ** of a continuous demand curve to actual data, even 
if such data can be obtained. We need only notice the convenience 
of the continuous demand function for theoretical purposes and point 
out the limitations of this continuity assumption. We proceed, there- 
fore, on the assumption that demand is a continuous function of a 
continuously variable price. The assumption is made for purposes 
of mathematical convenience ; it can be given up, if necessary, at 
the cost of considerable complications in the theory. 

5.3 Particular demand functions and curves. 

We come now to an assumption of very different nature from that 
of continuity. It is assumed that the larger is the price the smaller 
is the market demand for the good concerned. The validity of this 
assumption must be exammed, m the Ught of actual conditions, in 
each particular case. In some cases of demand, as Marshall has 
pomted out following Giffen, it is certainly not valid. But it is 
surely appropriate to all usual or ‘‘ normal ’’ cases of demand. Our 
main development can thus proceed on this assumption of falling 
demand, and the theory can be modified where necessary to meet 
any exceptional cases that arise. 

The assumption implies that x decreases as p increases, i.e. that 
the demand function is monotonic decreasing. It follows that the 
inverse function is also monotonic decreasing. We can take either 
demand as dependent on price, or conversely : 

^ = p = ^{x). X 

Here, both functions are single-valued, continuous and monotonic 
decreasing. In diagrammatic terms, the demand curve falls con- 
tinuously from left to right and it has the same general appearance 
whether we take Op horizontally or (as we do here) vertically? 

Further Testricti(!/ns on the form of the demand relation can now 
be added according to the particular requirements of the demand 
problem considered. Fig. 36 shows three general demand curves in 
certain hjrpothetic^^nd illustrative cases. The first curve 
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taken to represent the demand of a single family for sugar. ♦ The 
price is measured in pence per lb. and the demand in lbs. per month 
per adult member of the family. No sugar is demanded at prices 
above Is. 3d. per lb. As the price falls, the demand increases slowly 
at first, then rather rapidly and finally (for prices below 2d. per lb.) 
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slowly again. The family never demands more than the “ satura- 
tion ” amount of 17 lbs. per adult per month. The demand curve 
cuts both axes and is first convex and then concave to the price axis. 
The shape of such a family demand curve can vary, of course, over 
a wide range for different families and different goods. The form Dj 
is given merely as a fairly typical example. 

The demand curve is illustrative of the demand of a large 
market for sugar, taken as typical of a good with no closely related 
substitutes. The market demand becomes zero at Is. 9d. per lb., 
no fami^ making any purchases at this or any higher price. The 
demand then increases at an ever increasing rat§ as the price falls 

* An “ individual ” demand curve of this kind can be regarded as the 
limiting case of a “ market demand curve when only one consumer is t/iken. 
A demand curve, in fact, can be deiined for the whole group of consiuners or 
for |iny particular groi^p of them we care to specify. 

■ * 


ll.A. 
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until approximately 80 Mn. lbs. per month are consumed at very low 
prices. The curve cuts both axes and is convex to the price axis at 
all points. As an approximate representation of such a demand, we 
could take a demand curve approaching the axes “ asymptotically ” 
in the manner of the rectangular hyperbola (see 3.5 above). 

Sugar is a commodity which is well defined and clearly demarcated 
from its nearest substitute (say saccharine or jam). But most com- 
mercial commodities are such that there is, in each case, a number 
of closely allied commodities.* The variation can be obtained in 
many ways, e.g. by change of quality or by trade-marks. The 
demand curve is drawn to illustrate the demand for Player’s 
cigarettes, taken as t3q)ical of such a commodity. Assuming that 
the price of competing cigarettes is 6d. per packet of ten (a fixed 
price by our assumptions), the demand for Player’s cigarettes is very 
small at 9d. per packet and increases rapidly as the price falls, 
becoming very large when the price is 6d. per packet or less. The 
demand curve cuts the price axis and, since a small price fall results 
in a large increase in demand, it is nearly parallel to the demand axis 
for most of its course. The curve must cut the demand axis at some 
point, but only when the demand is very large compared with that 
at the price of (say) 6d. per packet. 

It is often convenient to represent a demand law by a definite type 
of function, at least as an approximation for a certain range of prices. 
The demand curve is then of definite class, e.g. a straight hno or a 
parabola. The following function types can be given as examples of 
suitable demand laws : 

(1) = — p=a-bx. (2) x = -6, 

6 ’ ^ ' p-Yc ^ x,+b 

(3) p = ^a- bx. (4) p = {a-bxY. 

1 

(5) X = p—a-bx^. (6) a; = 6p~®-fc, • 

Each of thes*e demand laws satisfies the “ normal ” monotonic de- 

* I.e. commodities of the kind considered in the economic theory of mono- 
polistic ** or “ imperfect ” competition. ^ ^ 
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creasing condition.* The variables x and p take only positive values 
and a, b and c are positive constants in each case. If actual numerical 
demand data are available, it may be possible to fit actual values to 
the constants and to draw a definite graph of the demand curve. 

The demand curve of the first law is a downward sloping straight 
line cutting the price axis at the point A where p=a and with 
gradient (referred to the price axis) numerically equal to 6. If the 
values of a and b are 
changed, then the demand 
line shifts in position. If 
a alone varies, the line 
is transposed parallel to 
itself. If b alone varies, 
the line is rotated about 
the point A on the price 
axis. If both a and b 
vary, the line shifts by 
transposition and rota- 
tion. The second demand 
law is {x +b){p ^c)=a, Fm. 37. 

The demand curve is now 

a rectangular hyperbola with asymptotes parallel to the axes and 
with centre at the point where and - c. The curve is 

restricted, of course, to that part of the rectangular hyperbola in the 
positive quadrant. The demand curve shifts in position when the 
values of a, b and c are varied. Variation in the values of b and c 
alone transposes the curve without change of shape. If b decreases, 
for example, then the curve is shifted bodily to the right. Variation 
in the value of a, however, changes the shape of the demand curve. 
Fig. 37 shows actual cases of linear and hyperbolic demand curves 
drawn from the demand laws : 



x^l6-bp or — 

and x — --20 or n= 

^ p +5 ^ a ; +.20 

In each case, x is the market demand for sugac, in Mif. lbs. per 

month, when the price of sugar is p pence per lb. 

* The last two coses involve exponential and logarithmic expression® (see 

9.8 below). In case (4)^ a; must be limited to the interval 0 ^ a; ^ x • 

• • 0 
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An important instance of the use of parametric constants is thus 
obtained. If a demand relation is known to be of one of the general 
types quoted above, then the position of the demand curve is fixed 
only when the parameters are given definite values. The values a, 
b and c are constants under given demand conditions. But a shift 
in demand results when the parameters are changed in value, the 
nature of the shift depending on the particular change adopted. 
Demand laws such as those given above suffice, not only for the 
description of some given demand situation, but also for the shifting 
of the demand situation over time. 

5.4 Total revenue functions and curves. 

The demand for a good X is represented by the two inverse func- 
tions, x = ^ (p) and p = 0 (r), of the demand law. When the demand 
is X and the price p, the product jB=a:p is called the total revenue 
obtainable from this demand and price. It represents the total 
money revenue of the producers supplying the demand and the total 
money outlay of the consumers composing the demand. Now 

B=p<l>{p)=xil>{x), 

i.e. B can be expressed either as a function of the price or as a function 
of the output (= demand). The latter is the more convenient ex- 
pression in most cases (e.g. in relating demand to cost of production) 
and the function B=^xip{x) is called the total revenue function of the 
given demand law. This function, like the demand function itself, 
assumes pure competition amongst consumers. The function is 
represented by the total revenue curve referred to a horizontal axis 
Ox and a vertical axis OR* The height of the total revenue curve 
.thus measures the total revenue obtainable from the output indicated. 

The total revenue function, being obtained from a continuous 
demand function, must itself be continuous. But it is not possible, 
at this stage, to indicate the “ normal form of the total revenue 
function consequent upon the assumption of a monotonic decreasing 
demand function. It will be seen later that the form of the total 
revenue function depends on the concept of elasticity of demand (see 
10,7 below). . We can simply notice here that R increases or decreases 

♦ The fact that Ox is the obvious axis to take as horizontal for the total 
revenue curve is one of the reasons for taking Ox as horizontal for the demand 
curve itself. 
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with increasing output according as the demand is (in a very general 
sense) elastic or inelastic. By an elastic demand, for example, we 
mean a demand which increases in greater proportion than the 
corresponding decrease in price. In this case, the revenue obtained 
from the demand must increase as the demand increases. 

If the form of the demand law is given, however, it is a simple 
matter to deduce the form of the total revenue function and curve. 
For example, in the case of the linear demand law p=a-bXyWe have 


B=ax -bx^. 

This expression can be written in the form 


p's - 
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total revenue curve is thus a parabola with axis vertical and 
pointing downwards (see 3.4 above). The highest point of the curve 

occurs where ^ = Total revenue increases as output increases at 

CL 

first, reaches a maximum value ^ at the output ^ = ^ then 

decreases as output increases further. A total revenue curve of this 
kind, obtained from a linear demand law, is shown in Fig. 44. * 

The total revenue curve contains no reference to the price of the 
good. But the price can be read off the curve quite easily. If P is 
the point on the curve at output x = OMy then 

R MP 

p = — = = gradient of OP, 

^ X OM ^ 


Price can be regarded, in fact, as average revenuey i.e. as the revenue 
per unit of the output concerned. As such, it is measured by the 
gradient of the line joining the origin to the appropriate point on the 
total revenue curve. Since price decreases as demand increases, the 
line OP becomes less and less steep as we move to the right along 
the total revenue curve. 


5.5 Costf functions and curves. 

The following assumptions are needed to obtain a simple expres- 
sion of the problem of cost of production. A given firm produces a 
single uniform good X with the aid of certain factors of production. 
Some of the factors*are empkiyed in fixed amounts irrespective of 
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the output of the firm, and it is taken that the expenditure on these 
factors is known and fixed. The remaining factors are variable and 
the conditions of their supply are assumed to be known. For 
example, the factors may be obtainable at fixed market prices. 
Further, the technical conditions under which production is carried 
out are taken as known and fixed. Finally, the firm is assumed to 
adjust the employment of the variable factors so that a given output 
z of the good X is obtained at the smallest possible total cost 17. 
Then 77 is determined when x is given and assumes different values 
as a; is changed, i.e. 77 depends uniquely on x : 

n=F(x) the total cost function. 

The variables x and 77 are necessarily restricted to positive values. 
The function is shown by a total cost curve referred to a horizontal 
axis Ox and a vertical axis Oil drawn in a plane. The varying 
height of the curve shows the changing total cost of an increasing 
output. 

The total cost function and curve are static constructions relating 
to various alternative and hypothetical outputs produced imder 
given conditions. They do not describe the variation of cost and 
output over time. The form of the function F depends on such con- 
ditions as the technique of production and the conditions of supply 
of the variable factors of production. The function and curve are 
fixed only for given conditions and are completely changed in form if 
any of the conditions are varied. For example, if the technique of 
production is improved by an invention or if the wages of the labour 
employed fall, then the cost function is changed so that the cost of 
any given output is less than before and the total cost curve shifts 
to a new position below the old curve. 

It should be noticed that the total cost function and curve are 
“ minimal concepts. By varying the employment of the variable 
factors, a given output can be produced at various total costs of 
which only the smallest is used to define the total cost function. In 
diagrammatic terms, the ordinate of a point on the total cOot curve 
represents th^ least cost of producing the output concerned and the 
same output could be produced at larger costs shown by points 
vertically above the point taken on the curve. The positive quadrant 
of the plane OxU is thus divided into tjvo areas by the curve ; points 
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in the area above the curve correspond to total costs at which the 
outputs concerned could be produced while points in the area below 
the curve represent total costs too small for production of the out- 
puts. The total cost curve is the lower boundary of the area repre- 
senting possible cost situations. But, since we usually need consider 
only the lowest cost of any output, we are interested in the cost curve 
rather than in the area above it. 

For purposes of mathematical convenience, the single-valued cost 
function can be taken as continuous, l^rther, under “ normal 
conditions with at least one factor of production fixed, it is assumed 
that there are fixed ‘‘ overhead ’’ costs, incurred even when no out- 
put is produced, and that variable costs increase as output increases.’*' 
The total cost curve, therefore, cuts the vertical axis at a point above 
the origin and rises continuously from left to right. Fig. 38 shows a 



total cost curve of “ normal ’’ form appropriate to a hypothetical 
case of a sugar refinery ; overhead costs are £250 per week and total 
cost increases with output, reaching a value of £1250 per week when 
the weekly output of sugar is 80 tons. 

It is sometimes convenient, at least for approximate represen- 
tations of cost over a limited range of outputs, to take the total cost 

• 

♦ If we consider an industry instead of a firm and take all “the factors of 
production as variable, then overhead costs are zero. Or, we may consider 
only the total variable costs of a firm and neglect overhead costs. In either 
case, the normal form of the total cost curve is similar to that given here 
except that it now star^fe from the^origin. 
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function of some particular form. The following are types of cost 
functions appropriate to the “ normal ” case : 

( 1 ) n=ax-\-b. y ( 2 ) U—ax^ -\-bx -\-c, 

' (3) 77=Van:+6+c. . (4) 77 —ax^ - bx^ ■\-cx +<i. 

(6) /7=ca; — +d. ^ (6) /7=ax*— +d. 

a;+c ' ^ x-vc 

(7) n=ae^^. (8) 77 = xV*+®4-d. 

The parameters a, b, c and d are positive in each case. In ca^se (2), 
the total cost curve is a parabola with axis pointing vertically up- 
wards and with vertex to the left of the origin at 3 ;= Since 

the curve is limited to the positive quadrant, it consists of an arc of 
the parabola rising continuously from left to right. The curve 
graphed in Fig. 46 below represents the cost curve 

77 = ^a;2 4-6a: 4-200, 

where £77 per week is the total cost of an output of x tons of sugar 
per week in a sugar refinery. 

The parameters in cost functions of types such as those above 
enable us to allow for shifts in cost. A cost curve given by (2), for 
example, is fixed if a, 6 and c have given values. As the parameters 
are changed, so the whole cost curve shifts in position corresponding 
to some change in the conditions under which the firm operates. An 
increase in c represents a rise in overhead costs and the cost curve 
shifts upwards without change of form. Changes in a and 6, on the 
other hand, correspond to modifications in variable cost and the 
cost curve is changed in shape as well as in position. 

At any output, the ratio of total cost to output defines average cost 
or cost per unit of output. Denoting average cost by 77 and using 
Fig. 38, we have 

77 MP 4 . r njy 

«• =- = ^ =gradient of OP, 


and average cost is read off the total cost curve as the gradient of the 
line joining the point on the curve to the origin. Since avecage cost 
varies with putprft, we have the average cost function rr:=f{x) where 


and a corresponding average cost curve referred to 

X X 

axes Ox and Oir in & plane. The ^rm of this function is to be 
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obtained from that of the total cost function. For example, if total 
cost is given by the function (2) above, then 

, c 

ft— ax +6 -h- . 

X 

Here 7 r-^<x> as and as x-^oo , i.e. the average cost curve must 
fall as output increases at small outputs and rise at large outputs. 
In the particular case of the sugar refinery already quoted, the 
average cost curve is of the form shown by the curve a.c. of Fig. 46. 
The average cost function, unhke the total cost function, is not 
necessarily monotonic. 

5.6 Other functions and curves in economic theory. 

It is appropriate, in some problems, to represent the conditions 
of supply of a good in a way analogous to those of demand. Suppose, 
for example, that a market consists of individuals who bring fixed 
stocks of various goods to a market-place for exchange amongst 
themselves. If there is pure competition amongst the individuals, 
the market prices (in money terms) of all goods being given, then 
each individual determines by how much he will increase or 
decrease his stock of any one good. If he wishes to increase his 
stock, he forms part of the demand for the good ; if he decreases his 
stock, he contributes to the supply of the good. Then, if all prices 
other than that of the good concerned are fixed, the total demand 
and the total supply of the good are defined, by addition over the 
whole market, as dependent upon the price. The demand and 
supply functions are exactly similar, supply being negative demand, 
and it can be taken that the functions are single-valued, continuous 
and monotonic, demand increasing and supply decreasing as price 
decreases. 

A different representation of demand and supply is appropriate to 
simple problems of international trade.* Two countries A and B 
exchange two goods X and Y, country A producing only X and 
countrjj B only Y. It is assumed that the conditions of production 
and the supply of resources are given in each country. Then country 
A can determine uniquely the amount of X it is willmg to offer in 
exchange for any given amount of F, i.e. the amount of X offered 

• • See Marshall, The Pyg^e Theory of Foreign Trade (1879). 
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is a single-valued function of the amount of Y taken, a function 
which appears as the demand and offer curve of country A (when 
referred to axes Oxy in a plane). A similar function and curve is 
defined for coimtry JS, the good Y being offered and X demanded. 
These curves differ from those previously defined in that they are 
referred to axes along which amounts of two goods are measured. 
The prices of the goods do not appear explicitly. But, if P is any 
point on the demand and offer curve of the country A, then the 
gradient, referred to Oxy of the line OP gives the amount of Y taken 
in exchange per unit of X, i.e. it gives the price of X in terms 
of T. A similar price can be defined, as a ratio of exchange, for the 
country B. 

We assumed, in 5.6 above, that only a single good was produced 
by the given firm. A first approach to a more general problem can 
be made by assuming that a firm produces two goods X and Y undet 
given technical conditions and making use of fixed supplies of certain 
factors of production. The total cost of production is now given 
and the interest lies entirely in the varying amounts of the two 
goods that can be produced. If a given amoxmt x of the good X is 
produced, then the fixed resources of the firm can be adjusted so as to 
produce the largest amount y of the other good Y compatible with 
the given production of X. Here y is a single-valued function of )x 
which can be taken as continuous and monotonic decreasing ; the 
larger the amount of X required, the smaller is the amount of Y 
obtainable. Conversely, if a: is the largest amount of X that can be 
produced jointly with a given amount y of F, then a; is a single- 
valued, continuous and decreasing function of y. The two functions 
must be inverse to each other, i.e. two aspects of a single relation 
between the amounts of X and Y produced, a relation imposed 
by the condition of given resources. In symbols, we can write the 
relation 

y)=0, 

giving y=/(*) and x=g{y), 

where/ and g are single- valued and decreasing functions to ho inter- 
preted in t’he way described. The relation, y) =0, can be called 
the transformation function of the firm and it serves to show the 
alternative productive possibilities of the given supplies of the 
factors. The corresponding transfornijilion curws in the plane day 
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is cut by parallels to either axis in only single points and is downward 
sloping to both Ox and Oy* Further, it can be taken, in the 
“normal ” case, that the production of Y decreases at an increasing 
rate as the production of X is increased, and conversely. The trans- 
formation curve is thus concave to the origin. Fig. 39 shows a 



* normal ” transformation curve in the hypothetical case of a firm 
producing two grades of steel with given supplies of ^bour, raw 
materials and equipment. The transforiliation curve, it should be 
noticed, is a “ maximal concept and forms the outer boundary of 
the productive possibilities. Any point within the curve corresponds 
to productions of X and Y possible with the given resources while 
productions represented by points outside the curve cannot be 
obtained no matter what adjustments of the given resources are 
made. 

The analysis can be generalised to allow for the use of given re- 
sources in the production of diiBFerent goods, not at the same time, 
but in diSerent periods of time. In the simplest problem, it is 
assumed that a firm, with given technical conditions and given pro- 
ductive resources, can arrange production in two years in various 

• 

♦The curve is sometimes called a “production indifference curve”, a 
terminology which is not to be recommended since the curve has little in 
common with the indifference curves described below. The term “ ttans- 
formation curve” is used (e.g.) by Hayek, Utility Analysis and Interest^ 
Economic Journal, 19311 . ^ 
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ways. It is convenient here to take incomes rather^ than physical 
production, i.e. to take the resources of th^ firm as producing, 
according to their allocation, various incomes this year and various 
incomes next year. Then, for a given income of ix this year, the 
firm can determine the largest income £,y obtainable next year from 
its resources. Conversely, the largest present income (JLx) can be 
found if next year’s income (ft/) is given. There is thus a transfor- 
mation function relating incomes in the two years, a function very 
similar to the transformation function described above and subject, 
in the “ normal ” case, to similar conditions. The corresponding 
transformation curve can be taken as downward sloping and concave , 
to the origin ; as the income desired this year increases, the income 
obtainable next year decreases continuously and at an ever increasing 
rate.* 

6.7 Indifference curves for consumers’ goods. 

The economic problems to which we turn now are interpreted, not 
in terms of a single function or curve, but with the aid of a fimction 
type or a curve system. The first of these problems is concerned 
with the choice of an individual consumer (e.g. a family) in respect 
of the whole range of consumers’ goods available on a market. In 
order to obtain a simple representation, we take the case where only 
two goods X and Y appear in the consumer’s budget and, to this 
extent, the problem is over-simplified. The basic assumption now 
is that the consumer distributes his expenditure on the two goods 
according to a definite “ scale of preferences ”. His “ tastes ”, on 
this assumption, are such that he can arrange all possible purchases 
of the goods in ascending order of preference and, given any two 
alternative sets of purchases, he can either tell which purchases are 
preferable or say that they are indifferent to him. 

The precise expression of this assumption can be most easily given 
first in diagrammatic terms. Measuring purchases of the two goods 
along axes Ox and Oy selected in a plane, any set of purchases (so 
much of Jt and so much of Y) is represented by a point P^with co- 
ordinates* (x, y) which measure the respective amormts of X and Y 
purchased. If we start with a given set of purchases represented by 

♦ This transformation curve is called, by Professor Irving Fisher, the “ j n- 
vestment opportunity line **. See Fisher, The Theory of Interest (1930), 
pp. 264 et aeq. , 
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a point Po(^o> !/o)> then all other possible purchases can be classed 
into three groups, those preferable to, those which are less preferable 
than, and those which are on a level of indifference with the basic 
purchases (Xq, f/o)* The third group of purchases is represented by 
a set of points, Pq, Pj, Pj, , making up a certain ciuwe. This is 
called the indifference curve, corresponding to the level of prefer- 
ence associated with the original purchases (x^, y^). Now take any 
purchases (Xq', f/o') not included in the indifference group already 
defined. A second group of indifferent purchases can be defined at 
the level of preference of (Xq\ y^) and represented by a set of points, 
^o^ Pti ••• » making up another indifference curve. This process 
can be continued until all possible sets of purchases have been in- 
cluded, and the final result is a whole system of indifference curves, 
each curve consisting of points at one level of indifference. Further, 
by the basic assumption, the indifference curves themselves can be 
arranged in ascending order of preference, i.e. according to the pre- 
ferences shown by the individual for the various levels of indifference 
attached to the curves. The indifference map, consisting of the whole 
set of indifference curves, serves to describe completely the indi- 
vidual’s scale of preferences for the two goods. To compare any two 
purchases as regards preference, we need only determine whether 
the corresponding points lie on “ higher “ lower or the same 
indifference curves. 

The basic assumption tells us only that some indifference map 
can be assumed. The continuity assumption is now added for 
purposes of theoretical convenience. It is assumed, firstly, that the 
individual can vary his purchases of each good continuously and, 
secondly, that the variation from one set of purchases to indifferent 
purchases is continuous. This assumption has, of course, no justifi- 
cation except as an approximation and on grounds of expediency. 
Each indifference curve can now be taken as continuous and the 
whole set of curves appear m a continuously variable order of 
ascending preference. 

Finally, we add three assumptions limiting the form of the in- 
difference map in the “normal” case of consutner’s^ choice. The 
first is the very reasonable assumption that no indifference curve 
intersects itself or any other indifference curve. This means* that 
there is only one diiiection of yariation from a given set of purchases 
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which leaves the consumer indifferent and that no set of purchases 
can be at two levels of indiflFerence. The second assumption is that 
the level of preference increases as we move across the indiflFerence 
map in the N.E. direction in the plane Oxy, This fixes the order 
of ascending preference of indiflFerence curves and implies that the 
consumer deems himself “ better oflF ** whenever he increases his 
purchases of both goods. From this reasonable assumption, it follows 
that the level of preference can only remain unchanged if an increase 
in the purchase of one good is offset by a decrease in the purchase of 
the other, i.e. the indiflFerence curves are downward sloping. The 
third assumption is that each indiflFerence curve is convex to each axis 
of reference, implying that ever larger and larger increases in the 
purchase of one good are required to compensate (i.e. to preserve 
a given level of indiflFerence) for a steadily decreasing purchase of the 
other good. It may be objected that this last assumption is not 
reasonable as a universal, or even usual, condition. This may be so, 
in which case the indifference curves must be assumed to be of other 
forms. But the assumption can be retained to cover “ normal ** 
cases until we find that our theory based on it fails to account for 
observable facts. 

Since the consumer’s preferences for two goods are represented by 
B, system of curves, the analytical expression of the problem follows 
from the considerations of 3.7 above. The indifference map has the 
property that one and only one curve of the system passes through 
each point of the positive quadrant of the plane Oxy. The complete 
map corresponds, therefore, to a functional relation between x and y 
involving one parameter a which can be separated off as shown in 
the general expression : 

y)=a. 

Here <j>{Xf y) is a continuous expression in the amounts x and y of 
the two goods purchased by the consumer. The variables x and y, 
and the parameter a, take only positive values. If the value of a is 
fixed, we obtain the equation of one definite indiflFerence curve of the 
system. Different indifference curves arise when the valus of a is 
changed. * The parameter a is thus associated with the particular 
; level of preference which corresponds to the purchases x and y of the 
I two ^oods ; it is an ‘‘ indicator ” of the ordinal and non-measurable 
concept of preference or “ utility ^ • 
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The following are actual examples of functional relations suitable 
for the expression of a “ normal ’’ scale of preferences for two con- 
sumers’ goods : 

' tJC "f* fh 

(1) where 0<a:<a(^ - - A. 

h- sjy -\-k 

y y \ 

\ (2) (x +A)(y +i) =a, where 

\ y ^ 

*\(3) X -{-y -\-sJ2xy=a, where 0<a;<a. 

Here a is the parameter of each preference scale and h and h are 
positive constants. Figs. 24, 26 and 26 show indifference maps 
drawn from relations of the above types and illustrates the fact 
that the preference scales are of “ normal ” form. 

It must be emphasised that the whole indifference map, given by 
<f){Xy y)~a, is needed to describe the preferences of the consumer. 
The parameter a is an essential part of the description. To allow for 
changes in the consumer’s tastes we must modify the form of the 
whole indifference map. It is the function ^{x, y) which reflects the 
particular tastes of the consumer ; the form of (f> is fixed for given 
tastes and is modified whenever tastes change in any way. These 
remarks can be illustrated by the relations (1) and (2) quoted above. 
The constants h and h have fixed values for a given indifference map 
but, by changing their values, we modify the whole map and allow 
for changes in the complex of preferences of the consumer. Hence, 
in representing a scale of preferences by a relation such as (1) or (2), 
the variation of the parameter a gives the different curves of the 
indifference map while the constants h and A;, given in value for a 
given complex of tastes, are changed only when we wish to consider 
the results of varying tastes.* 

5.8 Indifference curves for the flow of income over time. 

A system of indifference curves, very similar in nature to that for 

♦ It is important to be clear on the use of parameters here. The parameters 
of a demand or cost curve relate to external factors (e.g. the number of con- 
sumers or the prices of the factors of production). In a given situation, there 
is only a«ingle demand or cost curve. As the situation changes, the parameters 
take different values and the curve shifts in position. P#at a giver# situation as 
regards consumers’ preferences is described by a whole system of curves and 
a parameter is needed to take us from one curve to another. If the situation 
itself changes, we must add other parameters in order to change the* whole 
system of curves. The varying tastes of a consumer are described by the 
shifting of a system oficurves, i.e^by a system of a system of curves. 
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consumers’ goods, can be defined to describe a rather wider economic 
problem. An individual consumer can be assumed to act according 
to a scale of preferences, not only for goods purchased in one period 
of time, but also for goods bought in successive periods. But the 
discontinuous “ flow ” of purchases over time can be analysed on 
exactly similar lines to purchases at one time. In order to obtain 
a convenient representation of the wider problem, severe simplifying 
assumptions must be made. We assume that the consumer takes 
into account purchases of goods made in only two successive years 
and that his income can be taken as an indicator of his preferences 
for the actual collection of purchases made in either year. We can 
then speak of the consumer’s preferences for various groupings of 
this year’s and next year’s incomes, so much to spend now and so 
much to spend next year. 

On the basic assumption of a scale of preferences for this and next 
year’s income, the consumer’s time preferences for income can be 
represented by a system of indifference curves in a plane referred to 
axes along which we measure the amounts of this year’s income (x) 
and of next year’s income (y). One income grouping (so much this 
year and so much next year) can be compared with another grouping 
by the position of the corresponding points on the indifierence map. 
The groupings are indifferent to the consumer if the points he on 
one indifierence curve ; one grouping is preferred to the other if it 
hes on a ‘‘ higher ” indifierence curve. The income indifierence map 
corresponds to a function type of the form 

i}j{x, y)=a, 

where }fs{x, y) is a given relation between x and y and where a is a 
positive parameter indicating the level of preference for the incomes 
X and y. The indifference map can be assumed to be continuous, 
the curves being non-intersecting, downward sloping and convex to 
the axes. Preference for income combinations, as indicated by a, 
must increase as we move across the indifference map in the N.E. 
direction. These hmitations on the form of the indifference map are 
intended to apply to the “ normal ” case of a consumer’s preferences 
for income over tim?e and they are similar to the hmitations imposed 
in the previous problem of purchases of goods made at one time. 

Further properties of the indifference map for incomes can also be 
assumed in “ normal ” cases. If neither income js large, the indiffipr- 



FUNCTIONS AND DIAGRAMS IN ECONOMIC THEORY 129 


ence curves must be steeply inclined to Ox (the axis of present 
income) since some of this year’s income will only be sacrificed in 
return for relatively large additions to next year’s income. As the 
combined income of the two years increases, the curves tend to 
become less steep and, when both incomes are large, they approxi- 
mate closely to straight lines inclined (negatively) at an angle of 45®. 
The addition to next year’s income will then be little larger than the 
loss to this year’s income for which it compensates. It is seen that 
the curve system of Fig. 24 is suitable for the representation of such 
an income indifference map. The analytical expression is 

- - (0<x<o(A- 

A - vy +A; 

where a is the parameter indicating the level of preference for the 
incomes x and y in the two years. Fixed positive values must be 
given to A and k if the consumer’s preferences for income are known. 
These values are only changed when the whole scale of mcome pre- 
ferences is modified, 

EXAMPLES V 


Economic functions and curves 


1. The price of a certain brand of tea is p ponce per pound and it is found 
that the market demand is x thousand poimds per week where 


p 

20 

24 

28 

30 

32 

36 

40 

X 

82-5 

70-8 

631 

60 7 

650 

48'9 

39*8 


Represent these demands graphically and show that the demand law is 
approximately of the linear form a; = 120 - 2p, Then graph the total revenue 
curve and find the largest revenue obtainable. 

2. A cheap gramophone sells at £p amd it is known that the demand, 
X hundred machines per year, is given by 


^ 

X— 6 . 

p+6 


Plot a graph of the demand curve. At what price does the demand tend to 
vanish? Draw a graph of the total revenue curve and determine at what 
price and output the total revenue is greatest. 


3. A firm selling toothpaste investigates the form of the demand for its 
product by fixing different prices in four successive periods. It is found that 
the necessary outputs are » 


Price (d. per tube) 
Output (tubes per week) 
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Choosing suitable scales, plot the total revenue for these outputs graphically. 
Show that total revenue can be taken approximately as a linear function of 
output. Deduce the demand law. 

4 . What type of demand curve corresponds to the demand law 

a 

p=-.c. 

where a and e are positive constants? Show that there is some demand no 
matter how large the price. Is there any limit to the extent of the demand 
for small prices ? Show that total revenue falls steadily as output rises and 
compare with the case of the previous example. 


5. Examine the demand curve p=- 


where a and b are positive 

constants. Show that the demand increases from zero to indefinitely large 
amounts as the price falls. What type of curve is the total revenue curve? 
Show that revenue increases continuously to a limiting value and contrast this 
case with that of the previous example. 


6. From a consideration of the total revenue curves obtained in the 
previous two examples, why can wo say that these demand laws are not 
typical of ordinary demands, except as approximations for limited ranges of 
prices ? 

CL 

7. Of what twe is the demand curve : +c, where o, h and c are 

d X ■\'b 

positive constants? If ^ is small, show that a relatively small fall in price 

increases the demand from zero to a large amount. Is tliis a suitable approxi- 
mate demand law for (e.g.) Player’s cigarettes? 

8. A speculative builder of working-class tenements rents each tenement 
at p shillings per week. It is known that 

p =:\^225 -9x, 

where x is the number of tenements let per week. Graph this demand law. 
Is it the kind of law you would expect here ? 

9. Find a; as a function of p from the demand law p = s/a -bx. Show that 
the demand curve is an arc of a parabola with its axis parallel to Ox, Locate 
the vertex and indicate the shape of the curve. 


10. The number (x) of persons per day using a motor coach service to 
Southend is related to the fare ( p shillings) charged by the law 



Show that the demand curve is a parabola and locate its vertex. Also graph 
the total revenue curve, showing that revenue rises rapidly to a maximum 
before falling off slowly. Generalise by considering the demand law 
p = (a - bxy, where a and b are positive constants. 


11. If X is the number (in hundreds) of business men travelling by the 
8.15 a.m. tfain from Southend to London when the return fare is shillings, 
the dememd la^w is 


Draw the demand curve, show that it is a parabola and locate the vertex. 
Also draw the total revenue curve and find at what fetr^ the revenue is fi 7 ea};est. 
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Contrast the foiTns and economic interpretations of the demand curves in this 
and the previous example. 

12. At a charity performance at the local cinema, it is known that the 

d 

attendance at a imiform charge of p pence will be x =- - 6, whore a and b are 

constants. It is foimd that the cinema, which has 3000 seats, is half-filled 
when Is. is charged but that only one-sixth of the seats are empty at 9d. 
Find the constants a and b. Deduce the charge that fills the cinema and show, 
if all the proceeds go to the charity, that the latter benefits most in this case. 

13. In the case of “ constant *’ costs /7 =cm 5 + 6, show that the average cost 
curve is a rectangular hyperbola, average cost falling continuously towards 
the value a as output increases. A builder of small bungalows has fixed 
“ overheads ** of £6000 per year and other costs are always £300 per bungalow. 
Graph the curve showing average cost per bungalow when a variable number 
X is built per year. 

14. A fixed plant is used to manufacture radio sets and, if x sets are turned 
out per week, the total variable cost is £(3x-fi^x*). Show that average 
variable cost increases steadily with output. 

15. If the overheads of the plant of the previous example are £100 per week, 
find the average cost in terms of the output and draw the average cost curve. 
What is the least value of average cost? 

16. A coal retailer buys coal at a list price of 40 shillings per ton. He is 
allowed a discount per ton directly proportional to the monthly purchase, the 
discmmt being 1 shilling per ton when ho takes 100 tons per month. His over- 
heads are £50 per month. Obtain total and average cost as functions of his 
monthly purchase (x tons) and draw the corresponding curves for purchases 
up to 1000 tons per month. 

17. The electricity works in a small town produces x thousand units per day 
at a total cost of £(2N/40a; - 176 + 90) per day. Draw the total cost curve. 
Express the average cost, in pence per unit, as a fimction of a;, draw the 
average cost curve and show that, as output increases above a certain 
minimum, average cost decreases rapidly. 

18. The total variable cost of a monthly output of x tons by a firm producing 
a valuable metal is £(t'^* -3a;* + 60a;) and the fixed cost is £300 per month. 
Draw the average cost curve when cost includes (a) variable costs only, and 
(6) all costs. Find the output for minimum average cost in each case. Show 
that the output giving least average variable cost is loss than that giving least 
average (fixed and variable) cost. 

19. A firm produces an output of x tons of a certain product at a total 
variable cost given by 

n=cuc* - bx* -I- cx. 

Show that the average cost curve is a parabola, find the output for least 
average c®st and the corresponding value of average cost. 

20. A tobacco manufacturer produces x tons per day at a total cost of 

a;(x + 200) 

^4(x + 100)’ 

Graph the total and average cost curves and show that average cost decreewes 
continuously from lOs. f)er lb. towards a lower limit of 6s. per lb. 
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21« Generalise the previous example by taking the total cost as 

x + b 


n=ax~ 


a5 -f-c 


where a, b and c are positive constants (6 >c). Show that total cost increases, 
and average cost decreases, as output increases. Of what type is the average 
cost ciirve? 


22. If the tobacco memufacturer produces at a total cost of 

l400(a:+400) / 

when his output is x tons per day, draw the graphs of the total and average 
cost curves. Describe the way in which total cost increases with output and 
find the output at which average cost is least. 

Generalise to show that the total cost function 

+d 

x + c 

gives total and average cost curves of “ normal *’ form. 


23. On the market of a country town, butter is brought for sale from the 
surrounding district weekly. It is foimd that the weekly supply {x lbs.) 
depends on the price (pd. per lb.) according to 

X = lOOVp - 12 + 160. 

From a graph of the supply curve, show that there is no supply at any price 
less than Is. per lb. and that the supply increases continuously as the price 
increases above Is. per lb. Generalise by considering x=^a^p -6 + c, where 
a, b and c are positive, as a typical supply law. 

24, English cloth is exchanged for German linen. The amount of cloth 
(y million yards per year) offered by England in exchange for a given amount 
{x million yards per year) of linen is given by 

x(65-x)* 

^ 2400 

The amount of linen offered by Germany for a given amount of cloth is 

^_ y(25-y) 

10 

Represent these conditions diagrammatically and show that, for equilibrium, 
10 million yards of cloth exchange against 16 million yards of linen. 


26. With a given plant and given supplies of labour, raw material, etc., a 
chocolate firm can produce two “ lines ” in various proportions. If x thousand 
lbs. of one line are produced per year, then 


thousand lbs. per year is the output of the other line. Represent the con- 
ditions graphically. What kind of curve is this transformation oujwre? 


26, A man ^derives £x from his business this year and £y next year. By 
alternative uses of his resources, he ccm vary x and y according to 


y = 1000~ 


260 


Graph the transformation curve. If he wants £760 n^xt year, how muclf can 
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he take out of the business this year? Generalise by showing that y =6 

represents a “ normal ” case in this type of problem if the constants a and b 
are such that b>a. What is the transformation curve? 

27. If a coal mine works x men per shift, the output per shift is 



tons of coal. Draw a graph to show the way in which output varies with the 
number of men and find the size of the shift for maximum output. Express 
the average product per man as a function of x, draw the corresponding curve 
and show that it is a parabola. When is average product greatest? 

28. A consumer’s indifference map for two goods X and Y is defined by 

4 - 4- 1 

— z=a, 

a? + 4 

Draw a graph showing the five indifference curves for the values 2, 3, 4, 5, 6 
of the parameter a. Verify that they are of “ normal ” form. 

29. The relation (a; 4- l)(y +2) =2a defines a system of curves. Draw a 
graph showing the curves, in the positive quadrant, for a = 2, 3, 4, 6, 6. Are 
these curves suitable indifference curves for the case where an individual can 
derive incomes of £x and £y in two years? 

30. Consider the form of the indifference map given by the relation 

{x +h)^y =a, 

where h and k are fixed positive constants. Draw graphically a selected 
number of the curves in the case = 2, A; = 1 , and in the case = 1 , Aj = 1 . -Show 
that one case is suitable for the indifference map for two consumers’ goods 
and the other for incomes in two years. 



CHAPTER VI 


DERIVATIVES AND THEIR INTERPRETATION 
6.1 Introduction. 

Having made clear the nature of some of the fundamental ideas of 
the mathematical technique, we find ourselves in a position to attack 
one of the central problems of mathematics, the problem with which 
we are concerned in the “ differential calculus If j/ is given as a 
fimction of x, then the value of j/ is determined by, and changes with, 
the value allotted to x. It is clearly important to devise a method 
of comparing the changes in y with those in x from which they are 
derived, of measuring the rate at which y changes when x changes. 
In the differential calculus, we make precise what we mean by the 
rate of increase or decrease of a function, we set out a method of 
evaluating these rates for various functions and we systematise the 
problem by introducing convenient symbols, processes and rules. 
In the words of Professor Whitehead, ^e differratial calc^us 
is the^ ‘Iflystenjatic consideration of the rates of increase of _ 
f unctio nal*. 

In the present chapter, it is proposed first to treat the problem of 
defining and measuring rates of change from the purely analytical 
point of view. It is then found that a very important diagrammatic 
interpretation of the problem can be given. Finally, since the im- 
portance of any mathematical method depends on its use in inter- 
preting scientific phenomena, we shall consider quite generally the 
applicability of our new concepts in a wide range of scientific studies. 
The development of the practical technique of dealing with our 
problem will be given in the following chapter. « 

We can bejgin bj^ considering special fimctions of simple form. If, 
in the function we increase the value of x from one definite 

valrfe to a larger value x^t the value of the function increases from 
jTi* to The variable x is increased by amqunt {x^ -x^ and«the 
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corresponding increase in the function is (Xj® - Xj®). The average raie 
of increase of the function per unit increase in x is then 


Xj® - a^i® . , 

— =Xi* +X1X2 +x,*, 

*^2 *^1 t ' ; 

an expression which depends on both and Xg. The average rate of 
increase is dependent on the value of x from which we start and on 
the amount of increase allotted to x. 

Consider, now, only those increases which start from a definite 
value of X. Adopting a different notation, let x be the fixed original 
value of X and h the amoimt of the increase allotted. As a; is 
increased from x to (a; +A), the function increases by an amount 
{(x +A)® - a;®} and the average rate of increase is 


(a; - a:* 

h 


= 3a;2 4-3a:A 4- A*, 


which is taken as dependent on the variable increase A in a:. 

As the value of A becomes smaller, the average rate of increase 
is seen to approach the limiting value 3a;^ which we term the 
(instantaneous) rate of increase of the function at the point x. 
Hence, by definition, the rate of increase of the fimction at 
the point x is 


Lim 

A-K) 


{x -\-hY -x^ 
A 


= 3a;2. 




The average rate of increase of^tbe function is a perfectly definite 
concept for any actual increase A in a; however small. On the other 
hand, the instantaneous rate of increase is a limiting and abstract 
concept, the limiting value approached by the average rate of 
increase as smaller and smaller increases in x are allotted. It will be 
foimd that, although scientific phenomena are expressed directly 
i by means of average rates of increase, mathematical analysis finds 
it more convenient to treat the related abstract concept of the 
instantaneous rate of increase. The mathematically significant fact 
is that a*function is tending to increase at a certain rate at a definite 
point, not that it actually increases at a certain average rate over a 
definite range of values of the variable. 

The instantaneous rate of increase has a value for any value of x 
we eare to select. The rate of increase of x^ is Sx^ when we measure 
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it at the point x. So, the rate of increase of at the point a: = 1 is 3, 
at the point a; = 2 it is 12, and so on. In the first ca^e, the value 
of x^ is tending to increase at the rate of 3 units per unit of x and 
similarly for the other cases. The rate of increase of x^ at the point 
X changes as x changes, being always given by the expression 3a;^ v 

The rate of increase of a function of a; is itself a function of a;. 

'*» 

As a second example, it is seen that the function t/=- decreases 

/I 1 \ ® 

by an amount I =■ j as x increases by an amount h from x to 

(x -f A). The average rate of decrease of the function is thus 


-f--- 

h\x X +hj x(x-\^ 


:(x 4- A) 

The instantaneous rate of decrease of the function at the point x is 


Lim U- 

The function - decreases at various rates as x increases from different 

^ 1 
values, the rate being always given by the expression . 

The process described above can be extended and generalised to 
apply to any single-valued function whatever. Suppose the inde- 
pendent variable of the function y=f{x) changes in value by an 
amount A from x to (x +A). We can regard A as taking positive or 
negative values, a positive value indicating that x has increased and 
a negative^value that x has decreased. The corresponding change 
in the value of the function is then of amount {f{x 4- A) -f(x)} and 
this, again, can be positive or negative according as the value of the 
function has increased or decreased. The expression 


f{x +h) -f{x) 
k 


then indicates the average rate of change in /(a;) per unit change in x. 
The sign of this expression is important. If the sign is positive, then 
/ (x) changes in the same sense as x itself, mcreasing as x increases and 
decreasing as x decreases. If the sign is negative, then/(x) changes 
in the opposite sense to x, increasing when x decreases and decreasing 
when X increases. The important question is whether the average 
rate of change tends to approach any <jefinite limiting value as A gets 
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Bmaller through positive and negative values. I f a limit exists, the n 
it represents the instantaneous rate of change of f(x) at the point x, 
i.e. the rate at which /(a;) is tending to increase or decreasi^accpr^ 
to the sipi of thi^^]^ x changes from tho point x. If no limi t 
exists, ^hen the value of the function changes' in an erratic way as 
smaller and smaller changes are made in the variable x, changes 
which cannot be described by any definite instantaneous rate of 
change. The whole problem of the existence and value of the rate 
of change of a function y=f{x) thus turns on the existence and value 

i)f the limit of the ‘‘ incrementary ratio ^{/(^ ^ tends 

to zero at a definite point x. 

^ In the first example considered above, the rate of change of a;* 
at the point x is given by and, since this is positive, we know that 
increases as x increases. On the other hand, the rate of change of 


the second function y = - at the point x is 


Lim 

A-M) 


I = Lim ( - = 

h\x+h xj x(x+h)} 


• \V> 

Except for the sign, this is identical with the rate of decrease we 
obtained previously. In fact, the negative sign of the rate indicates 
that it is a rate of decrease as x increases and the numerical value, 


i.e. — , then measures the actual rate of decrease at any point. A 

X 

rate of change, having sign as well as magnitude, is a wide concept 
including both rates of increase and rates of decrease. 


6.2 The definition of a derivative. 

If y =f{x) is a single-valued function of a continuous variable, and 

if the incrementary ratio ^{/(^ ~/(^)} tends to a definite limit 

as the value of h tends to zero through positive or negative values, 
then we say that the function has a derivative at the point-x-given 
in value by the limit of the incrementary ratio. If the incrementary 
iratio has no limiting value, then the function has no ^derivative at 
the point x. 

A number of different notations for the derivative of a function 
are in current i^sel The two following notations are the most common 



138 


MATHEMATICAL ANALYSIS FOR ECONOMISTS 


and they ’ndll be used throughout the present development. Tlje 
derivative of y=f{x) at the point x is written either as 

!=>)• 1 

or 843 

Since the function itself can be written as j/ or as/ (x), each of these 
notations appears in two equivalent forms. Hence : 

Definition : The derivative of the function y —/{x) with respect 
to the variable x at the point x is 


if the limit exists. 


It is convenient, at this, stage, to set out a number of observations 
on/the definition of the derivative : 

\/(l) The concept of a derivative, from its very nature, apphes 
only to a function of a continuous variable. Further, the defini- 
tion given here applies only to frmctions which are single-valued 
and the extension of the concept to cover midti-valued functions is 
postponed. 

/(2) The incrementary ratio used in the definition is not defined 
when h is actually zero ; in this case it assumes the meaningless form 
of zero divided by zero. Thus the derivative cannot be regarded as 
the ratio formed by putting A=0 in the incrementary ratio. The 
derivative, in fact, is the result of an operation p erformed on the 
fimction, the operation of obtaining the limit of the incrementary 

ratio. In the notation or ;^/(^)> must be regarded as an 

operational symbol applying to the function and its use is very 
similar to that of the symbol / in the functional notation f{x). In 

dti 

particular, it must be noted that the symbol ^ does not imply that 

derivative is a ratio of one thing {dy) to another (dx), • v 

J (3) The *incremedtary ratio can be expressed in an altemati \^3 
form which is sometimes useful. Corresponding changes or “ incre- 
ments in the variables x and y connected by the function can be 
denoted^by Ax and Ay respectively. T]ie increnaientary ratio is then 
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^ V 

and the derivative appears as the limit of this ratio as Ax tends 

to zero. Both Ax and Ay take finite values and, if a value is allotted 
to Axy the corresponding value of Ay is found from the function 
iteelf. 

(4) The value of the derivative must be expressed as so many units 
of y per unit of x. If we change the units in which either variable 
is measured, then we must also change the numerical value of the 
derivative at any point. The rule for change of units, considering 
the derivative as a quantity derived from x and y=f[x)y is given in 
1.7 above. If an old unit of x equals A new imits and an old unit of y 


equals /x new imits, then the new value of the derivative is ^ times 
^ A 


the old value. The fact that the derivative value depends, according 
to the familiar proportionality rule, on the units of measurement of 
both X and y must always be remembered. 

(5) It is important to be clear about the use of the value x 
that appears in the definition of the derivative. The derivative is 
obtained by finding the limit of the incrementary ratio for a constant 
value of X. The limit, once found, has a value dependent on the 
particular value of x selected, and different limiting values result 
when different constant values are given to x. Here we have 
another example of a parameter : a; is a constant in finding the 
limit (when h is the only variable) but parametrically variable from 
one limit to another. The value of the derivative thus depends on 
the value fixed for x, i.e. the deriva tive is itself adjunction of x. There / 
are, in fact, two points of view ; the derivative may be obtained as; 
a definite value for a given x (the derivative at a given point) or it 
may be taken as a function of x (the derivative at a variable point). 
From the latter point of view, the notation f'{x) is particularly 
appropriate and an alternative term “ derived function ” can be 
used. From the original function f{x), we derive a second function 
f'{x) by the process of finding derivatives of f{x) for all the various 
possible Talues of x, 

^fo) It must not be assumed that the derivative 6f f{x\ necessarily 
exists at any point. In fact, the derivative cannot be used unless 
we have determined that the incrementary ratio has a limit at the 
point in question. A function piay have a derivative at some points 
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and not at others ; some functions may have a derivative at a point 
and others not. It wiU be seen (6.6 below) that the non-existence 
of a derivative is exceptional and of very special significance. 

(7) The derivative notations we have given apply to any function 
y=^f{x). The notations must be suitably modified when a particular 

form is given for a function. So, ^ (a:®) and ^ ] denote the 

cLx (Jix \x/ 

derivatives of the particular functions x^ and - respectively. The 

x 

notation also needs modification when we wish to write the value of 
the derivative, not at a general point x, but at some definite point. 
The following examples illustrate the flexibility of the notation. 

Il/(x)]^_^=/'(0) and 

denote the derivatives of f(x) at the particular points a; = 0 and x=i 
respectively. More generally, the derivative of f(x) at a; = a is 

For a particular function, say y=x^, the notations 

' HH,., “<> 

indicate the derivative value at a:=0, a: = J and a;=a respectively. 

(8) Finally, it is scarcely necessary to note how convenient it is to 
have a special notation and terminology for the derivative concept, 
liimits of the kind indicated by the derivative are of frequent appear- 
ance in mathematical theory and applications. The short and 
flexible derivative notation, in replacing the clumsy limit notation, 
makes it possible to deal with these limits with great ease. In 
particular, the process of systematising the operation of finding the 
limit, to be discussed in the next chapter, is facilitated. Without 
this labour-saving device, scarcely any progress is possible. 


6.3 Examples of the evaluation of derivatives. 
The considerations of 6.1 show that 


• and 

Thd following examples are further illustrations of the way in which 
derivatives of particular functions are obtained from the definition : 
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Ex. 1. The derivative of (2x-l) at any point x is obtained as the 
limit as h tends to zero of the ratio 

{2{a:+A)-l}-(2a;-l) 2x + 2h-l-2x + \ 2h „ 

A ~ h h~‘^' 


So 




The derivative, in this case, is a constant. A similar result holds in the 
case of the derivative of any linear function : 

d 


dx 


(aa;+6) = a. 


JEx. 2. The incrementary ratio for the function {x^ +3x - 2) is 
J[{a: + A)* + 3 (a: + A) - 2} - (x* + 3x - 2)] 

= + 2Ax + A*) + (3x + 3A) -.2 -jx* +3* -2)] 
2hx 

h 

■= 2x + 3 + A 
->2x-h3 as A“>0. 

So ^{x^ -\-3x - 2) :=2x + 3. 

In the same way, the derivative of the general quadratic function is 


^(ax^ +6x+c) =2ax +6. 


j' 


2x + 1 

Ex. 3. For the function incrementary ratio is 

X 1 ^ i 

1 f 2(x + A)+l 2x + n _ (x-l)( 2 x + 2A + I)-(2x + l)(x+A-l| 
A\(x+A)-1 x-1 J A(x - 1)(x - 1 +A) ‘ 

-3A 

A(x - l)(x - 1 +A) 

-3 

“(x-l)(x-l+A) 

3 


So 


(x-1)* 

d / 2x + l \ ^ 

dx\x-l/ (x-j 


as A-)-0. 


(x-1)* 

The result, as given, holds for any value of x not equal to I. If x > I. the 
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function itself has an infinite discontinuity and the question of the 
existence of the derivative at this point needs further consideration (see 
6.5 below). 

6.4 Derivatives and approximate values. 

The analytical interpretation of the derivative as the measure of 
the rate of change of the function is clear from 6.1 above. Further, 
since the rate of change is the limiting value of the average rate of 
change over a small interval, the derivative is also an approximate 
value of the average rate of change, the smaller the interval of the 
average rate the closer being the approximation. Hence : 

If f{x) has a derivative f'(x) at the value x, then f'{x) measures 
the rate of change of f(x) at the point x, and is an approximate 
measure of the average rate of change of f(x) for any small interval 
from the point x. 

As already noted, the rate of change can be a rate of increase or a 
rate of decrease according to the sign of the derivative at the point. 
In either case, the derivative as the rate of change must be expressed 
as so many imits of the function per unit of x. Two units of measure- 
ment are involved in the derivative and, if either unit is changed, so 
is the expression of the derivative. 

The use of the derivative as the approximate value of the incre- 
mentary ratio or average rate of change is an important one. If 
Jx is any small increment in the value of x, and if zJy is the corre- 
sponding small increment in the value of y=f{x), then the deriva- 
tive f'{x) is the limiting and approximate value of the quotient of 
Ay divided by Ax, Hence 

Ay=:f'{x)Ax approximately when Ax is small. 

In other words, if we multiply a small increment in a; by the rate of 
change f'{x), we obtain an approximate value of the corresponding 
small increment in the function y=f{x). 

Again, with a slightly dififerent notation, the value of the derivative 
f'(a) at the point x=a is an approximate value of the expression 

r{/(a -f A) -/(a)} for a small value of h, i.e. 
n 

f{a +A) =f{a) 4-A/'(a) approximately when h is small. 

This result enables us to trace the effect of smjU “ errors or dgvia- 
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tions in the variable x. The value of x is erroneously taken as a 
instead of the correct value (a +A), where A is a small “ error 
The above result shows that the “ error ” in the value of the function, 
resulting from the insertion of a instead of (a +A), can be taken 
approximately as hf(a). For example, the volume of a sphere of 
radius x inches is ^ttx^ cubic inches. If the radius of a given sphere is 
estimated as a inches, then the estimate of the volume is cubic 
inches. The error involved in the volume estimate is approximately 
4:7Ta% cubic inches, where h is the small error in the original estimate 


of the radius. 



It is often convenient, for a definite small range of values of Xy to 
replace the value of a given function f{x) by an approximate linear 
expression in a;.* The respR-Jwe have given enables us to do this. If 
the approximation is to ^erve for values of x near a: =a, we substitute 
A = a; - a in our result aikd write ’ 

7 (*) =/(«)+(* 

approximately for smajl values of (x-a). For example, if 

‘ \ o 



J*/ V A 

/ W =^_1 » 

then 

/(3) = 2 a 

So 

2a: +1 7 

— ^ 1 
, — 2 "1 


a;- 1 

i.e. 

2x +1 

a; - 1 


and 


(x-l) 


2 > 


2 3 
4 




approximately when {x-3) is small. In general, therefore, it is 
possible to represent any given function by an approximate linear 
expression, f{x)=oc +^x, for a small range of values of x. The 
constants a and ^ are determined in the way described. 


6.5 Derivatives and tangents to curves. 

One of the important problems of geometry is that of finding the 
tangent^at any point on a given^ctw^ve. The tangent describes the 
direction of the curve at the point and, to define it,^we must*introduce 
the idea of a limiting process. The hne joining any two points P and 

♦ See, for example, Wicksell, Uher Wert, Kapital und Rente (1893, reprinted 
1933), pp. 31-2. 
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Q on a curve is called a chord of the curve and its position varies as 
either P or ^ is varied. Suppose that P is fixed and that Q moves 
along the curve and approaches P from one side or the other. It is 
clear that it is usually, but not always, the case that the chord PQ 
tends to take up a definite limiting position PT as Q approaches P. 
Fig. 40 illustrates a case where this happens. It is possible, however. 



that no such limiting position PT exists at certain points, e.g. at 
points where the curve is discontinuous. If the limiting position 
PT of the chord exists, then it is called the tangent to the curve at 
|P. If no limiting position exists, then the curve has no tangent 
at P. 

The instantaneous direction of the curve at any point, i.e. the 
direction in which the curve is tending to rise or fall, is indicated by 
the tangent to the curve at the point. A convenient measure of the 
direction of the curve is thus provided by the gradient of the tangent. 
This gradient varies as the tangent varies from point to point on the 
curve and is always given as the limiting value of the gradient of any 
chord PQ as Q approaches P. 

Our discussion is now limited to the case of a curve repre^nted by 
a single- valued fufiction y=f(x), there being only one point on the 
curve above or below each point on the horizontal axis Ox. Let P 
and*© be two points on the curve with abscissae x and (x +h) respec- 
tively, X being fixed and h any positive or negative amount. The 
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gradient of the chord PQ is the ratio of the diflFereiice between the 
c^dinates of Q and P to the difference between the abscissae of these 
points (see 3.3 above). So 

Gradient of PQ = . 

h 

The gradient of the tangent at P is the limiting value (if it exists) of 
this expression as Q approaches P, i.e. as Hence 

Gradient of tangent at P=Lim — =f'(^x). 

The problem of finding the tangent and its gradient at any point on 
a curve is thus equivalent to the problem of finding the derivative 
of the function y=f(x) which represents the curve. So 

The tangent to the curve y=f{x) at the point with abscissa z 
exists if the function has a derivative at the point x and the tangent 
gradient =/'(x). 

From 3.3 above, a straight line through a fixed point P (Xj, y^) on 
the curve y=f{x) and with gradient m has equation 

y-yi=m(x-x^). 

This line is the tangent at P if the gradient m is equal to/'(Xi). So 


The equation of the tangent to the curve y =/(x) at the point with 
co-ordinates (x^, y^) is y -yi=f'{xi){x -Zi). 


Since the point P lies on the curve, we have y^ =f{xy). The tangent 
equation thus involves only one parameter, i.e. the abscissa Xj 
of the selected point on the curve. 

Taking the parabola y=x* as an example, we can show that 


^ (x*) = 2x, i.e. the tangent gradient of the parabola is 2x at the point 
dx 


with abscissa x. 


The tangent equation at (x^, y^ is 
y-yi = 2xi (x - Xi) where y^ = x^*, 


i.e. y-2xiX-Xi2 +2xi* = 0, 

i.e. * y-2xiX 4-Xi*=:0. , 

For example, the point (1, 1) lies on the parabola y =x*, the tangent 
at the point has gradient 2 and the equation of the tangent is 
y - 2x -f 1 =0. The tangent gradient and equation at any point on 

•f MJL 
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a given rectangular hyperbola can be found in an exactly similar 
way. It is only necessary to evaluate the derivative of the function 
which represents the curve. 

The diagrammatic interpretation of the derivative as a tangent 
gradient is a very important one and much use of it wiU be made 
later when we come to the applications of the derivative in such 
problems as that of maxima and minima. In the meantime, it can 
be used to supplement points already discussed. 

It has been seen that the process of derivation enables us to 
derive, from a given function y=/(a:), a second function y=f\x). 
The curves representing these two functions can be plotted with 
reference to the same axes and the connection between them is 
interesting. The ordinate of the derived curve y=f'(x) at any point 
is equal to the tangent gradient at the corresponding point on the 
origmal curve y^f{x). The original curve shows the way in which 




the value of the fimction/(a;) changes ; the derived curve shows the 
variation of the rate of change of the fimction. For example, the 
curve y = \x^ and the corresponding derived curve y^\x are shown 
in Fig. 41. The first curve shows that x* increases at an increasing 
rate as x increases. The second curve, which is a straight line, shows 
that the rate of increase of actually increases at a steady rate. 

The nature of the approximation involved in writing /'(a:) Ja; for 
the increment in the function /(x) is clearly seen in Fig. 42. li P is 
the point (it, y) on ttie curve y=f{x) and Q the point with abscissa 
a: -f- Ja;, then *Ax—LM—PN. The increment in the function ia 

• NT 

Ay=MQ - LP=NQ. Now, /'(a;) = gradient of the tangent PT = ^ , 

• ^ PN 
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i.e. f{x)Ax=NT. The approximation of f'{x)Ax to is thus 
represented by the approximation of NT to NQ, The difference 
between these lengths clearly decreases as Ax decreases and Q 
approaches P. 

It is obvious that there is some relation between the continuity of 
a function and curve and the existence of a derivative and tangent. 
It can be shown, in fact, that the existence of a derivative and 
tangent is a more severe restriction on the function and curve than 
the condition of continuity. Of the two continuous curves of Fig. 33, 
the first has a tangent at all points while the second ha;S one point 
(a sharp point) where no tangent exists. The fact that a curve is 
continuous does not imply that a tangent exists at all points. A 
continuous curve can be 
drawn without taking pencil 
from paper ; a curve which 
has tangents at all points is 
not only continuous but also 
“ smooth ” in the ordinary 
sense of this term. All 
the ordinary functions of 
mathematical analysis are 
confir ’‘^us and have deriva- 
tives (except perhaps at 
very special points) and ’ Fio. 43. 

they are represented by 

continuous and smooth curves. Further, when we introduce a 
function such as a demand or cost function, it is usual to assume, not 
only that it is continuous, but also that it has a derivative, ex eluding 
the case where the corresponding curve has a sharp point. 

Finally, nothing has been said so far about the possibility that the 
incrementary ratio defining the derivative has an infinite hmit. If 

+ A) as h-^0 at a point where /(x) exists, we say 

that the de^vative/'(a;) is “ infinite ” at the point. In diagrammatic 
terms, there is a definite tangent to the curve y =/(a?) at the f)omt P 
with abscissa x and the gradient of the tangent is infinite, i.e. the 
tangent is parallel to Oy, The case is illustrated by the curte 
t/ = 1 1, shown m^Fig. 43, which has a vertical tangent at the 


yi 
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point where a? = 1 . Infinite derivatives can be obtained even in cases 
of ordinary continuous and smooth curves. Further, i£f{x) is itself 
infinite at the point in question, we can still say, in many cases, that 
the derivative exists and is infinite at the point. The rectangular 

2a; -fl 

hyperbola of Fig. 33 is a case in point. Both the function — and 

3 

its derivative -7 rri are infinite at x = l. The curve “goes off 

(a;-!)* 

to infinity ** at this point and has a vertical tangent “ at infinity ”, 
i.e. the asymptote x = l. To deal with many infinite discontinuities, 
therefore, we have only to allow for the possibility of both the 
function and its derivative being infinite. 


6.6 Second and higher order derivatives. 

The derivative of a function of a variable x is also a function of x. 
It is possible, therefore, to apply the process of derivation all over 
again to the derivative and to obtain what is called the second de- 
rivative of the original function. The second derivative, if it exists, 
can be obtained for various values of x and is again a function of x. 
We can thus proceed to find the derivative of the second derivative, 
i.e. the third derivative of the original function. In the same way, 
derivatives of the fourth, fifth and higher orders can be obtained in 
succession. Extending the notations already adopted for the de- 
rivative, the sequence of successive derivatives of a given function 
yz=f[x) can be written 


dx’ 

d*y 

dz»’ 

d^y 

dx^ 




f, ^ 

Of y, y', 

y”. 

y”', ... ; 

f{^). 


/"W, 

/'». ... 


The sequence of derivatives can be extended indefinitely, with the 
sole limitation that the derivatives at and beyond a certain order 
may fail to exist on account of the breakdown of the limiting process 
which defines them. For all ordinary mathematical functions, how- 
ever, the derivatives of all orders exist.* But it is foimd that the 
first and second derivatives are sufficient for the description of the 
main properties’^ of functions, and, in the applications of mathe- 


♦ It often happens, of course, that the derivatives beyond a certain stage 
become zero, aa in the case of the function y =xK This is not the same thing 
as saying the derivatives do not exist. 
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matical analysis, derivatives of higher order than the second are 
seldom needed. 

The analytical interpretation of the second derivative is easily 
provided. The value of f'(x) measures the rate of change of f(x) as 
X increases. The value of f”(x), being the derivative of f'(x), thus 
indicates the rate of change of /'(a;), i.e. the rate of change of the rate 
of change of /{x), as x increases. For example, the function x* in- 
creases at an increasing rate as x increases and this “ acceleration in 
the value of x^ is measured by the second derivative of the function. 
The value of /"(a;), it should be noticed, must be expressed in so 
many imits off\x) per unit of x, i.e. in so many units off{x) per unit 
of X, per unit of a;. As in the case of the first derivative, a change in 
the units of measurement necessitate a change in the value of the 
I second derivative. 

' In diagrammatic terms, the value of /"(a:) at any point measures 
the tangent gradient of the derived curve y=f'{x) at the point. A 
more useful interpretation, however, connects the second derivative 
with the curvature of the original curve i/=zf{x)y curvature being 
defined by means of the rate of change of the tangent gradient as we 
move along the curve. This connection will be investigated at a 
somewhat later stage (see 8.3 below). 

6.7 The a pplicati on of derivatives in the natural sciences. 

In the interpret^on and explanation oi scienffiio phenomena, we 
attempt to throw into prominence the causal or other relations 
between the concrete quantities or attributes with which we deal. 
Mathematics is designed as a tool to aid us in this attempt. Now, a 
glance at the problems of the natural or social sciences shows that 
they are largely, but not exclusively, concerned with the question of 
change, with the way in which the variable quantities change together 
or in opposition. The practical method of expressing change is 
by means of the average rate of change of one quantity as another 
quantity varies. Hence, when we come to express any assumed con- 
nection between quantities by means of a functional and mathe- 
matical relation between variable numbers, we fiifd that cfur main 
concern is with the average rates of change of these functions. 

These remarks apply even if discontinuous variables and functions 
are used to represent the necessarily discontinuous quantities of 
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scientific phenomena. But it is found mathematically convenient 
to assume, wherever possible, that the variables and functions are 
continuous and then to deal with instantaneous rates of change as 
the limiting and approximate values of average rates of change. 
Sometimes it is necessary to retain discontinuous variables, but the 
theory then becomes mathematically much more difficult and much 
less elegant. We are led, therefore, to the study of the derivatives 
of continuous functions and their use in the interpretation of the 
scientific phenomena under consideration. 

The differential calculus is thus of almost immediate application 
in all scientific studies. Any relation between variable quantities is 
represented by a function, and the derivative of the function measures 
the instantaneous rate of growth of one quantity with respect to the 
other and approximates to the actual average rate of growth when 
the variation considered is small. 

Examples of the use of derivatives can be provided from most 
scientific theories. In the science of physics, derivatives are em- 
ployed to express physical movement of any kind. As a simple 
example, consider a train moving along a railroad track. The dis- 
tance travelled by the train from a fixed point depends on the time 
that has elapsed, i.e. the distance travelled is a function of the 
time-interval.* All that can be actually observed about the train’s 
motion is its average speed (over some time-interval, long or short) 
and this can be connected with the “ incrementary ratio ’’ of the 
distance function. But, from the observable concept of average 
speed, we derive the limiting concept of the velocity of the train at 
a certain time. Velocity is simply the limiting value of average 
speed as the time-interval of the latter is made shorter. Hence, we 
need the derivative of the distance function to measure the velocity 
of the tram at any moment and to approximate to the average speed 
of the train over any short time-interval from the given moment. 

The importance of the limiting idea involved in the derivative is 
seen by the way in which we have come to express physical motion. 
Velocities are not “ natural ” concepts ; they can be deyved only 
by a process of abstraction, of finding limiting values. When we say 

• For example, if a; is the distance travelled in time then x=at represents 
motion at a uniform speed and x—at* motion at a uniform acceleration, a 
being a constant in each case. , 
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that a train is travelling at 60 m.p.h., we are expressing a rather 
complicated idea. We do noF mean that the train will actually 
travel 60 miles during the following hour, one mile during the next 
minute or even 88 feet during the next second! The speed of the 
train wiU generally change during any time-interval however short. 
The statement means that the distance travelled by the train per 
unit of time will correspond more and more closely to 60 m.p.h., or 
any equivalent, as the time-interval is made shorter and shorter In 
physics, therefore, we have accepted the mathematically significant 
limiting concept of velocity as replacing the practical concept of 
average speed. 

The seco nd de rivative of distance travelled as a function of time 
is the derivative of the first derivative and so measures the rate of 
change of velocity over time. The rate of change of velocity is 
commonly described as the occeleratiQ n and this is again a limiting 
concept, the limiting rate at which velocity is changing. Accelera- 
tion is, in fact, another concept to which we have grown accustomed 
in the description of movement and its abstract natiu^ has ceased to 
be a difficulty. 

The units of measurement of both distance and time must appear 
in the derivatives which measure velocity and acceleration, a ‘fact 
which is clear from the way in which velocities and accelerations are 
specified. We say, for example, that the velocity of a train is 44 feet 
per second (30 m.p.h.) and that its acceleration is 2 feet per second 
per second at a given moment. This means that the train is tending 
to cover distance at a rate proportional to 44 feet in one second and 
that its velocity is tending to increase at a rate equivalent to 2 addi- 
tional feet per second in a second. In the same way, when we say 
that the constant acceleration of a body moving xmder gravity is 
32 feet per second per second, we imply that the velocity of the body 
is tending to increase always by 32 feet per second for every second 
that elapses. 

In other branches of mechanics or physics, the derivatives of 
functions are used to interpret motion, whether the motion is of 
liquids in hydro-mechanics, of air and gases in acoustics ‘and the 
theory of sound, of electric currents in electro-mechanics or of heat 
in thermo-mechanics. Further, in a less “ exact ” science such as 
biology, derivatives ^again have their uses. For example, t])o 



152 mathematical ANALYSIS FOR ECONOMISTS 

changes in the proportion of dominant to recessive genes through 
successive generations can be interpreted, in a broad view, as the 
derivative of the proportion as a function of time.* 

6.8 The application of derivatives in economic theory. 

Finally, we come to economics as the most “ exact of the social 
sciences. We have seen that a relation assumed between economic 
variables, e.g. the relation between demand and price, can be ex- 
pressed by means of a fimction and curve. It is usually assumed 
that the fimction and curve are continuous and smooth, i.e, the 
function has a derivative and the curve a tangent at all points. The 
derivative of the function and the tangent gradient of the curve must 
now be considered in some detail and their economic meaning made 
clear. 

We are accustomed, in economics, to describe the variation of one 
quantity Y with respect to another quantity X by means of two 
concepts, an average concept and a marginal concept. The average 
concept expresses the variation of Y over a whole range of values of 
X, usually the range from zero up to a certain selected value. Thus 
average cost is the relation of total cost to the whole of the output 
concerned. Marginal concepts, on the other hand, concern the 
variation of Y “ on the margin i.e. for very small variations of X 
from a given value. So the marginal cost at a certain level of output is 
the change in cost that results when output is increased by a very small 
um’t amount from this level. It is clear that a marginal concept is 
only precise when it is considered in the limiting sense, as the varia- 
tions in X are made smaller and smaller. It is then to be interpreted 
by means of the derivative of the function which relates X and Y. 
Average and marginal concepts are not new or peculiar to economics. 
In describing the motion of a train, the average concept is the 
average speed of the train over (e.g.) its first hour’s run while the 
marginal concept is the velocity of the train at the end (e.g.) of the 
hour’s run. 

Some actual examples can be taken to illustrate the average and 
marginal concepts. We have seen that the demand for a good 
can be exp’ressed, under certain conditions, by the relation p = 0 (x), 
giymg. price as a continuous and decreasing function of the demand.^ 
^ See Haldane, The Oattsee of Evolution (^1932), Appendix. 
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But the price p can be regarded as the average revenue obtained from 
the demand i.e. total revenue divided by the amount demanded 
or produced. The ordinary demand function and curve are thus 
equivalent to the average revenue function and curve. Total revenue 
R=xtlt{x) is also a continuous function of demand. From the total 
revenue curve, we read off average revenue (or price) as the gradient 
of the line OP joining 0 to the appropriate point P on the curve. 

If output is increased by a small amount Ax from a certain level 
suppose that total revenue is found to change by an amount AR. 
There is an increase or a decrease in revenue according to the sign 
of AR. The added revenue per unit of added output is then the 
ratio of AR to Ax, i.e. the ‘‘ average revenue ” for outputs from 
a; to a: -f Ja;. As the output change becomes smaller, we obtain the 
rate of change of revenue on the margin of the output x as the limit 
AR 

of ^ as Ax-^Q. This is termed the marginal revenue at the output x 

and it is measured by the derivative of as a function of x : 

Marginal revenue I ’ ^ / 

Marginal revenue is thus an abstract concept only definable for 
continuous variations in revenue and output. But it is always 
approximately equal to the added revenue obtained from a small unit 
increase in output from the level a;.* In diagrammatic terms, the 
marginal revenue at an output x is measured by the tangent gradient 
to the total revenue curve at the point with abscissa x. 

Marginal revenue is itself a function of the output x, its value 
depending on the particular margin of output considered. We can 
thus draw the marginal revenue curve to show the variation in 
margmal revenue as output increases. As in the cases of the total 
and average revenue curves, the outputs are measured along the 
horizontal axis. Since margmal revenue is the tangent gradient of 
the total revenue curve, the form of the marginal revenue curve can 
be deduced at once from that of the total revenue curve. 

♦ The term “marginal increment of revenue which is sometimes used, is 
Dot to be 'Confused with “ marginal revenue **. The former is the increment 
AR in revenue resulting from any increase Ax in output fr^m the levfel x. Now 

dR * 

AR = — Ax approximately when Ax is small. Hence the marginal increment 

of revenue for a small increase in output is approximately equal to the mar- 
ginal revenue times the ^crement in output* 
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Two particular types of demand law can be taken for purposes 

of illustration. The linear demand law p =a -bx gives 

— 

R —ax - bx* and -j- =a- 2bx (see 6.3, Ex. 2). 

_ . dix 

! The total revenue curve is a parabola with axis pointing vertically 
downwards. The average and marginal revenue curves are straight 
hnes sloping downwards and the gradient (referred to Ox) of the 
latter is twice that of the former. Fig. 44 shows the three curves in 
the particular case where the demand law for sugar is p — ld-ix 
(see 6.3 above). Here, as in all cases, it is foimd convenient to plot 



Fio. 44. 


the average revenue (or demand) curve on the same graph, and 
referred to the same axes and scales, as the marginal revenue curve. 
It is to be noticed that the marginal revenue curve cuts the axis Ox 
(marginal revenue zero) at the same output at which total revenue 
is greatest. This is a general property which follows since, at this 
output, the tangent to the total revenue curve is horizontal with 
zero gradient. 

From the demand law p = — ^ -c, we derive 
^ x-hb 

n ^ dR ah , ^ . V 

= (se« 7.4, Eit. 15). 


In Fig. 45 are shown the total, average and marginal revenue curves 
obtained in the particular case where the demand law for sugar is 

p = - f> (see^ 6.3 above). Again the marginal revenue curve 

X -hJO 


falls continuously, lies under the average revenue (or demand) curve 
and cuts Ox at the output where the total revenue is greatest. 
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The three concepts of total, average and marginal cost can be 
defined, under the conditions of the cost problem of 5.5 above, in 
a very similar way. The total and average cost functions and 
curves have been sufficiently described already, average cost being 
read off the total cost curve as the gradient of the line OP joining 0 
to the point P with the appropriate abscissa on the total cost curve. 



If output is increased by an amount Ax from a certain level x and if 
the corresponding increase in cost is d/7, then the increase in cost per 


unit mcrease m output is . 


Marginal cost is defined as the Umiting 


I value of this ratio as Ax gets smaller, i.e. marginal cost is the deriva- 
I tive of the total cost function n=F(x), It measures the rate of 
increase of total cost and approximates to the cost of a small addi- 
tional unit of output from the given level. Further, the marginal 
cost of any output is given as the tangent gradient of the total cost 
Curve at the appropriate point. Since marginal cost varies with the 
output at which it is measured, we have a marginal cost function and 
a corresponding marginal cost curve.* 

If the total cost function is 77=aa;* +bz -i-c, then 

Average cost tt = oa; + 6 -I- ~ and Marginal cost ^ = 2ax + 5. 


The total cost curve is the rising portion of a parabola with axis 
pointing vertically upwards. The average cost curve is found to be 
U-shaped and the marginal cost curve is a straight link sloping 


♦ Marginal cost is described as, or equated to, “ marginal supply price ” 
by some writers. See, for example, Bowley, The Mathematical Orou^work 
of Economics (1924), p. 34. 
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! upwards with gradient 2a. The three curves are shown in Fig. 46 in 
the particular case where 77 +200 is the cost relation for 
a sugar refinery (see 6.6 above). It is to be noticed that the marginal 
cost line passes through the lowest point of the average cost curve 
at an output OM where the tangent to the total cost curve goes 
through the origin. At such an output, average cost is smallest and 
equal to marginal cost. 



Average and marginal concepts can be usefully defined in other 
economic problems. The transformation problems of 6.6 above 
provide examples. If P is a point on the transformation curve for 
the production of two goods X and Y with given resources, then the 
co-ordinates of P represent a possible production. The gradient of 
OP (referred to Ox) is the average output of Y per unit output of X, 
the amount of Y produced as a proportion of the amount of X 
produced. The tangent gradient of the transformation curve at P 
(referred to Ox), i.e. the derivative of y with respect to x, is negative 
and its numerical value measures the marginal rate of substitution 
of Y production for X production, the rate at which the production 
of F is increased when the production of X is decreased. In the same 
way, the tangent gradient of the transformation curve for incomes 
in two years is used to define what Professor Irving Fisher calls 
the marginal rate of return over cost.* Marginal concepts are also 
fundamental in the use of indifference curves for choice between two 
goods or two incomes. The “ marginal rate of substitution ” of one 
good for another in consumption is defined as the numerical value 

♦ See 9.7 below and Examples VI, 31, 
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tl^tangent gra<iient^at the appropriate point on an indiflference 
curve for the two goods. Again, the numerical value of the tangent 
gradient of an indifference curve for two incomes is used to define 
the “ marginal rate of time preference for present over future 
income in the case of the individual concerned. These concepts will 
be considered in detail at a later stage. 


EXAMPLES VI 


Evaluation and interpretation of derivatives ♦ 


1. Find, from the definition, the derivatives of a?* and (1 +a;)*. 

2. Show that the derivative of a constant is zero and that the derivative 
of X is unity for any value of x. Interpret these results in terms of rates of 
change. 

3. Use the definition of the derivative to establish the general result that 
the derivative of (ax*+6a;4-c) is (2aa;+6). 


4. Evaluate ^[x{ \ -{-x)) and ~{x{\ -x)} from the definition. Show that 

CLX CL'JO 

these derivatives are respectively the sum and the difference of the derivatives 
of X and «*. Does this suggest a general rule? 


5. From the definition, show that 
verify the derivative of ^ • 

X - I 


d /aia?+6A 
dx \a^ + 6,/ 


afit - ajbi 

(a^ + 6 ,)* 


and hence 


0, Show that the derivative of a step-function is zero except at certain 
points where it does not exist at all. Illustrate the statement that a function 
has no derivative at a point of discontinuity. 


7. Find the average rate of increase of x* when x increases from 1 to 1*1. 
What percentage error is involved when the derivative of a;* at a; = 1 is used to 
represent this average rate of increase? 


^8. Find the average rate of decrease of 


2a? .-fl 
a? - 1 


as a? increases from 2 to each 


of the values 3, 2*6, 2-1, 2 06 and 2 01. Find, from the derivative, the in- 
stantaneous rate of decrease at a? = 2 emd compare with each of the average 
rates. 

y \ X 1 

Evaluate the derivatives of — emd Show that — =3-2a5 and 

^ a?* a? + 1 X* 

= i(a; + 1) approximately when x is nearly equal to 1. What percentage 


a? + 1 

errors are involved in the use of these approximations when a: = 1*1 ? 


10. If a: is small, show that /(a?) =/(0) +a:/'(0) approximately. Find the 
derivative' of aJifi show that this expression can be represented approxi- 
mately by (1 -a?) when a? is small. Express the difierenoe between the ex- 
pression and its approximation in terms of x. 


♦ Further examples on the interpretation of derivatives are given in 
Examples VII, 24-37 be]ow. 
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11. Show that /(a -fa:) =f(a) -\-xf'{a) approximately when x is small, 
the derivatives of x* and — , show that 

X* 


(l-fx)* = l+3x 


and 


1 

(2a: + 3)* 


1 4 

“ ■“ _ ^ X 

9 27 


From 


approximately when x is small, 
out as a cubic expression in x. 
*= 0 - 6 ? 


Check the first result by multiplying (1 +x)* 
How good is the second approximation when 


12. Use the derivative of x* to show that the approximate increase in the 
area of a circle is 2rra/i square inches when the radius is increased by a small 
amount h from the value a. Calculate the actual increase in area when the 
radius increases from 10 to 10- 1 inches and compare with the approximate 
value given by the above formula. 

13. A circular ink -blot grows at the rate of 2 square inches per second. 

Show that the radius is increasing at the rate of — inches per second at the 
time when its length is r inches. ^ 

14. The surface area and volume of a sphere of radius r are S = and 
V =f7rr*. Find approximate expressions for the increases AS and AF when 
the radius is increased from r by an amoxmt Ar. Deduce that the propor- 
tionate increase in S is approximately twice, and in V approximately three 
times, the proportionate increase in radius. 

15. The pressure of a gas at constant temperature is p lbs. per square inch 
and the volume v cubic inches where pv = 10. Find the rate of decrease of 
volume as pressure increases and the approximate decrease in volume when 
the pressure is increased from p by an amount Ap, 

16. At what point on the parabola y=x* does the tangent make equal 
angles with the axes? Use the derivative of (1 -f x)* to obtain the equation of 
the tangent at (0, 1) on the parabola t/ =x* + 2x -f 1. 

17. What is the equation of the tangent to the parabola y =x* -f 3x -2 at 
the point with abscissa x = a7 Show that there is one point on the parabola 
with tangent parallel to Ox. What is this point? 


18. From the derivative of (ox* -f 6x -f c), show that the tangent at the point 
(Xj, j/i) on the parabola y =ax* -f 6x -f c has equation 

y -f t/i =2axxi -f 6(x +Xi) +2c. 

19. Show that the tangent at the point (1, 1) on the rectangular hyperbola 
xy = 1 cuts equal lengths ofi the axes. 


20. Show that xy^ -f yxj =2a* is the tangent at the point P{Xi, yi) on the 
rectangulcu* hyperbola xy = a*. If the tangent cuts the axes in A and B and 
if PM and PN are perpendicular to the axes, show that M bisects OA 
and N bisects OB, Show also that the area of the triangle OAB is a constcmt 
independent of the position of P on the curve. ^ 


21. From the derivative of 


2x -f 1 
x~l 


find the equation of the tangent at the 


point; (Xj, y^) on the rectangular hyperbola (x - l)(y - 2) =3. Show that the 
tangent gradient tends to become infinite as Xj-^l. What is the relation of 
this fact to the vertical asymptote of the curve? , 
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22. Find the derivatives of ax and arc*, where a is a constant. 

A ball rolls at feet in t seconds. Show that the average speed over any time> 
interval and the velocity at any time are both constant. 

A lead pellet falls at^ feet in t seconds. What is the average speed between 
the ijth and the f,th seconds? Find the velocity after t seconds. How good 
an approximation is the velocity after 3 seconds to the average speed between 
times t=r3 and t = 3-6 seconds? 

23. Show that the acceleration of the lead pellet of the previous example 
is constant over time. 

24. What is the derivative of (1 +2rc)? Show that the electric current in a 
wire is constant if (1 +2^) imits of electricity pass a point in the wire in t 
seconds. 

25. Draw the total revenue curve on one graph and the demand and 
marginal revenue curves on another graph in the case of the demand for 
gramophones of Example V, 2. Verify that the marginal revenue is zero at 
the output giving greatest total revenue. 

26. If the market demand for tea is given by the law of Examples V, 1, 
draw the total revenue curve and read off the tangent gradient to give the 
marginal revenue at demands of 60, 60 and 70 thousand lbs. per week. Check 
these values from the expression for marginal revenue. Draw on one graph 
the demand and marginal revenue lines. At what price does marginal revenue 
vanish ? 

27. Evaluate the derivative of (ax+bx*). If 100a: passengers travel on the 
train of Examples V, 11, find an expression for the marginal revenue derived 
by the railway company. Draw the marginal revenue curve, showing that it 
is a part of a parabola falling to the right. Find the fare at which mar^ginal 
revenue is zero and verify that this fare produces the greatest total revenue. 

28. A firm produces radio sets according to the conditions of Examples V, 
16. Draw two graphs, one showing the total cost curve and the other the 
average and marginal cost curves. Verify that average and marginal cost are 
equal when the former assumes its least value. 

29. If the cost of a coal merchant’s supply of coal is of the form described 
in Examples V, 16, show that the marginal cost decreases steadily as the 
monthly purchase increases. 

30. From the derivative of (ax^ -hbx* -hex), find the marginal cost of any 
output in the case of the firm of Examples V, 18. Show that the marginal cost 
curve is a parabola and locate the output for least marginal cost. Draw the 
average and marginal cost curves on one graph and show that the output for 
least average cost is greater than the output for least marginal cost. 

31. A business produces an income of £x this year and £t/ next year, where 

these values can be varied according to the relation y = 1000 - * Explain 

how ~ interpreted as the marginal rate of return over 

cost. Show that the value of this marginal rate is wh^n this year’s 

income is £x, 



CHAPTER Vn 

THE TECHNIQUE OF DERIVATION 
7.1 Introduction. 

The first object of the differential calculus is to systematise the 
process of finding the derivatives of functions, to make the writing 
down of a derivative an almost mechanical matter. Text-books on 
the subject necessarily devote a considerable amount of space to 
this stage, but it must be remembered that the systematic evaluation 
of derivatives is not an end in itself. We evaluate derivatives only 
because they are useful to us in the application of mathematical 
methods in the natural or social sciences. These applications would 
be extremely laborious were it not for the technique of the differential 
calculus ; we should find ourselves repeating over and over again the 
same' kind of algebraic process, the process of finding the limits of 
certain expressions, whenever we need the derivatives of particular 
functions. We have, therefore, to learn to evaluate derivatives easily 
before we can apply them fruitfully in economics or any other scien- 
tific study. 

The systematised technique of derivation involves two steps. The 
first consists of the evaluation (from the definition) of the derivatives 
of the simplest functions, the results, which are called “ standard 
forms ”, being set out in tabular form and memorised. The table 
of standard forms, once obtained, is taken for granted and the 
derivatives it contains simply quoted whenever they are required. 
All the functions considered in the present chapter are actually 
derivable from one simple function, the “ power ” function. Our 
table of standard forms, therefore, need contain only a single entry 
at the moment, i.e. the derivative of the simplest power function. 
Many other'’ fimctions, such as the trigonometric, exponential and 
logarithmic functions, are used in mathematical analysis and some 
of these will be introduced at a later stage. It will then be necessary 
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to extend the table of standard forms by evaluating the derivatives 
of the simplest examples of the new functions. 

The second step is to frame a set of rules which serve for the 
derivation of more complicated functions. A given function, no 
matter how involved, is reduced to a combination of simpler functions 
the derivatives of which appear in the table of standard forms. The 
rules for derivation simply tell us how the derivative of a combination 
of this kind can be obtained in terms of the simpler standard form 
derivatives. The combinations may involve sums, differences, pro- 
ducts or quotients. But, as we shall see, they may also include a 
very different kind of configuration. In all cases, the rules for deriva- 
tion are relatively simple. With the table of standard forms and 
sufficient skiU in the manipulation of the rules, we can set about 
evaluating derivatives with the greatest confidence. This confidence 
should be acquired in the course of the present chapter. 


jf 7.2 The power function and its derivative. 

In elementary algebra we deal with expressions of the power 
type a", where n is called the index (or exponent) of the given base o. 
The meaning of the power varies, however, with the nature of the 


number n. 


For example, a*=a xa, ai = Ja, and = 

^ V a* 1 


In general terms, the power a" is to be interpreted as follows : 

(1) If n is a positive integer, a” denotes the result of multiplying 
a by itself n times. 

(2) If n is a positive fractional value, a" denotes the positive value 


r 

of a certain root : a« = V a^. 

(3) If n is a negative integral or fractional value, a" denotes the 


reciprocal of the corresponding positive power: a « = 


'(4) If n is zero, a** stands for unity : a® = l. 

(5) If n is an irrational number, the power is more complicated 
in meaning. We can write n, in this case, as the limit of a sequence 
of integral or fractional numbers r^, Wj, n„ ... . T^e power ^a** is then 
defined as the limit of a"f as r~>Qo . 

In all cases, the power a** obeys the familiar “ index laws ” developed 
in elementary algebra. ^ 



162 


MATHEMATICAL ANALYSIS FOR ECONOMISTS 


One of the simplest of all functions is the power function 
where the index n is a fixed number. The variable x takes values 
continuously from the whole set of numbers except that only positive 
values can be considered in certain cases (such as y — Jx), A dis- 
tinctive characteristic of the power function is that both the function 
and its inverse are of the same type. For example, the power func- 
tion y=x^ gives the inverse function x = Jy, which is a power function 
of y. 

The derivative of a;", for any value of n, is now required as a 
standard form. A number of simple cases can be taken first i 


d / . T * 


A 

dx 


{x^) =Lim 


{x +hY-x^ 


=Lim 

A-M) 


2hx 


=Lim (2a; -*-A) = 2aj, 

A-i-O 


/ T • +A)* - a;* . ihx^ +3^2^ -f 

j {^ ) =Lun ^ { =Lim 7 

dx " ' A A 


/ A 


A-»^ 

=Lim (3a:* -h3Aa; +A^) = 3a;*, 
A-»o 


^/1 
dx 


(-)=Lim i(-^r-)=Liin — JL^= -i, 

\xj A^A\a;+A xj a:(x-fA) a:* 


i.e. 


sp)”-*"*- 


If 1 11 x*-{x-\-h)' 

dx \xy - ™ A\(x +A1* a:*/ “ 1“ hx^{x +A)» 


=:Lim 


-2x-h 


i.e. 


Ho »*(*+*)*" .*»’ 

fUx)=Lim 

^ * A-+0 h(^x +h +Jx) 


T . (x+h)-x 1 1 

k-*o h\-Jx +h + ^x) h-tO\/x+h+^x 2^® 


i.e. 


d 


y 


-{xi) = ix-K 


A uniform law is now apparent. In each of the above cases the 
derivative of *" is found to be of the form This is, in fact, 
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true for any given value of n whatever and, to provide a general 
proof, we need a result not established here. The result can be ex- 
pressed quite simply : 


(1 +a:)" = l +na; +Ax*, ^ 

where A is some quantity involving x which is finite even when x-*0. 


Then ;^(a:") = Lim — — 

dx' A 


=Lim ^ 
h -*0 ft 


, * . 
1 +71 — + A 

X 




= Lim (- +A-^)=a:'*-: 
h-^Q \x x^J X 


-nx 


n-l 


Hence, we have the standard form : 


a 

— (x") for any fixed value of n. 


This result can now be quoted whenever required. For example, 
we are entitled to write at once, not only the particular derivatives 
evaluated above, but also many others such as 




d 

dx 



- 3x-« = - 


3 , 

X*’ 


dx 


(Ix^) 




A 

dx 





2 

1 


Many rather different kinds of derivatives are thus included in the 
one standard form. The method of treating fractional and negative 
indices is to be noticed. 


7.3 Rules for the evaluation of derivatives. 

We come now fo^the second stage in the technique of derivation, 

to the framing of a set of rules for the derivation of combinations of 

• ° 

functions. The first three rules relate to combimitions which are 
sums (or differences), products and quotients respectively. In each 
of these rules, as set out below, it is assumed that u and v are two 
given functions of x with known derivatives at the point concerned. 
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Rule I. The derivative of a sum or difference. ^ 

The derivative of the sum (or difference) of two functions is the 
sum (or difference) of the separate derivatives : 


(u +t;) : 


dx^ dx 


d . du dv 

dx^^ t;)— 


Ritle II. The derivative of a product. 

The derivative of the product of two functions is equal to the 
first function times the derivative of the second plus the second 
function times the derivative of the first : 

d , . dv du 

Rule III. The derivative of a quotient. 

The derivative of the quotient of two functions is equal to the 
denominator times the derivative of the numerator minus the 
numerator times the derivative of the denominator, all divided by 
the square of the denominator : 

du dv 
d /uN y dx ^ dx 
dx\v) v^ 


A -1 ^ 1 

As a* special case : — I - I = — - — . 

^ dx \u/ u^dx 

Formal proofs of the three rules proceed directly from the defini- 
tion of the derivative. The proof of the first rule is : 

If u = ^(a;) and v = iIj{x) are the two given functions, then 

— («+t;)=Lim +^)} - 

dx' ^ A_»<, h 

=Lim •l>{x+h)-<p{x) 

h-»0 * *-M) A 

_du dv 
dx dx* 

and similarly for the difference of u and v. We use here the fact 
(see 4.5 above) that the limit of a sum (or difference) is the sum (or 
difference) of the separate limits. The proofs of the other two rules, 
though more complicated, can be given in essentially the sa\ne way.* 
These rules. can also be deduced from the first rule by a method to 
be described at a later stage (see 10.2 below). 

* See, for example. Hardy, Pure MathemaHcs, (3rd Ed., 1921), pp. 203-4. 
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Important deductions from Rules I and II concern the behavior 
of constants in the process of derivation. A constant can'4)e 
regarded as the (limiting) case of a function of x which does not 
change m value as x varies. A constant must thus have a zero 
rate of change, i.e. the derivative o f a constant is zero. Rules I and 
II now give 


d , . du 


and 




du 

dx^ 


where u is any function of x and a is any constant. Hence, an 
additive constant disappears when the derivative is taken and a 
multiplicative constant remains unaffected by derivation. These 
facts are of very great service, as the examples of the following 
section show. 

The three rules are set out above for the case of a combination of 
two functions only. But, if more than two functions appear in a 
combination, the rules can be applied several times in succes- 
sion to give the derivative. It can be noticed, however, that the 
sum and difference rule extends at once to give the derivative 
of the ‘‘ algebraic ” sum of a number of functions as the similar 
‘‘ algebraic ” sum of the derivatives of the separate functions. For 

example, , j j j 

^ d , ^ du dv dw 

-{u+v-w)=^+^-^. 


where w, v and w are three functions of x with known derivatives. 

The derivative of a product or quotient involving more than two 
functions is less easily obtained, as is shown by the following example: 
Taking the product {uvw) as (uv) times w, we obtain 


d^ 

dx 


(UT^) —uv 


dw 

dx 


d , . dw 


( 


dv 

u— -fv 
dx 


du\ 




(X^ 


du dv dw 
=^vw Xuw - 7 - Xuv ^ 
dx dx dx 




In an exactly similar way, we can show that^ 




dx 



du dv 

vw-r- -\-uw^ — uv 
dx dx 


dw 


)-■ 


I . 


It is possible to put the product and quotient rules in a someT^hat 
different form suitable for the extension to the case of more than 
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two functions. The product rule for the derivative of the function 
y=uv can be divided through hy y^uv and written 

\dy ^\du \dv 
ydx udx vdx* 

In the same way, the quotient rule for y = “ can be arranged 

Idy _\du \dv 
ydx udx vdx 

Further, the derivative of y^uvw obtained above becomes 

Idy \du \dv \ dw 
ydx udx vdx w dx* 
uv 

and the derivative of v = — becomes 

w 

Idy _^\du \dv \ dw 
ydx u dx vdx w dx 

We now have a common form of the product and quotient rules 
which can be extended, in an obvious way, to the case where any 
number of functions are involved. The meaning of this alternative 
expression of the rules will be made more clear later (see 10.2 below), 


7.4 Examples of the evaluation of derivatives. 

The examples below illustrate the way in which the rules for 
derivation are used in practice. The derivative of a fairly compli- 
cated function is to be obtained by several applications either of the 
same rule or of different rules. The method of dealing with constants 
in the derivation process is particularly to be noticed. It is also 
clear that some derivatives can be obtained in two or more different 
ways. The result is always the same and all that matters is to get 
it in some way or other. 

E..1. 


Ex. 2. 
Ex. 3. 


£ 

dx 


©■ 


dx\x. 




1 
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Ex. 6. + 3z-2) =2* + 3, 

E..6. 


Ex. 7. For any fixed values of the constants a, 6, c, , we have 

d d 

■^(ax + b) =a; -^{ax^ +bx +c) = 2ax +6; 

d 

^ (ax^ + bx^ + ca; + d) = 3aa;^ + 2bx + c ; and so on. 

Ex. 8. ^ {a:(l +a:^)} =x ^(1 +a;^) + (1 +z^) ^(•^) =a:(2a:) + (1 +a:*) 

= l+3a;^ 

E,.9. +:>)(! +«l 

= (1 +x)^~{l +x) + (1 +ar) =2(1 +x), 

r ^(1 +a;)*=-^(l +2a; + x2) = 2^(a:)+-^(x*)=2(I +*). . 

ax dx dx dx 

Ex. 10. |^{x(l+x)(l+2x)}=x(l+a:)^(l+2x)+(l+2x)^{x(l+x)) 

= 2x(l+x) + (l +2x)(l+2x) 

= 2x + 2x* + 1 +4x +4x* = 1 +6x +6x*, '' 

r ^ {x(l + x) (1 + 2x)} =^{x + 3x^ + 2x3) = 1 + 6a: + 6x». 


Ex. 11. 


Ex. 12. 


V3a: + 2/ 


-^^(3x.2) 


dx\3x+2/ (3x + 2)3 


3 

(3x + 2)3’ 


dx\ X J x* 


x(4x) -{2 x3h.1) 2x3-1 




1 2x3-1 


or 



168 


MATHEMATICAL ANALYSIS FOR ECONOMISTS 

d / 1 +a; \ 

-2x) 


(^l-2x)Ul+x)-{\+x)U\-2x) 

Ex. 13. ^ 


(1 - 2a;)* 

(l-2a;)+2(l+a;) 3 


Ex. 14. 


Ex. 16. 


(l-2a:)* (l-2a:)*‘ 

±[ ? ] 

dx\{x-\-l)(x+2)j 

_{x-^l){x-\-2) -a:(x4-l +x + 2) x^ -2 

(a; + 1)2 (a; + 2)2 “ " (a; + 1)2 (a; + 2)* ‘ 

d f ax 
dx\x + b 




(a: + 6) -X 
(a: + 6)2 


ah 

(x + by 


-c. 


Ex. 16. ^ {xf(x)} =x^f(x) +fix) ^{x) =xf'(x) +f{x), 

L _ xf'(x) -fix) 

dx\ X X* X* 

where / (a?) is any single- valued function of a?. 


7.5 The function of a function rule. 

The three rules we have given are not in themselves sufficient for 
the derivation of aU the functions met with in ordinary mathe- 
matical analysis. On the contrary, there are many relatively simple 
fimctions the derivatives of which cannot be found by means of the 
three rules. Consider, for example, the function 

y = j2x^-3, 

which is a “ mixture ” of the simple quadratic function (2a;2 - 3) and 
the simple square root function. On attempting to evaluate the 
derivative, however, it is found that the function does not break 
up into sums, differences, products or quotients of “ standard form ** 
functions. The “ mixture ”, in fact, is not as before. We have a 
function which is fundamentally different in nature and* we need 
some method of evaluating its derivative. 

The problem is solved by introducing a new variable z 

u=:2a;*-3. 
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Here, ti is a function of x and its derivative is 


du 

dx 


dx 


{2x^-3)=4x. 


The original function now appears as y = JUy i.e. as a simple function 
of the variable u with a known derivative with respect to ui 


du 




1 

2Ju 


The function y = ^2x^ - 3 has been arranged, therefore, in such a 
way that we can say that y is a function of u, where is a function 
of X, Both these functions have known derivatives. Can we now 
deduce the derivative of y with respect to xi The answer is provided 
by the very simple rule given below. 

The way in which we have arranged the function y = j2x^-3 
shows that it can be described as a “ function of a function of x 
It is clearly only one example of a wide range of such functions to 
which the following rule for derivation applies : 


Rule IV. The derivative of a function of a function. 

If y is a function of u where is a function of a:, then the derivative 
of y with respect to a; is the product of the derivative of y .with 
respect to u and the derivative of u with respect to x : 


dy dy du 
dx du dx 


The formal proof of the rule proceeds : 

If y=f{u), where u = (f){x), then y=f{<f>{x)} and 


dx A_>o h 


Let k = ^{x-\-h) - ^{x)->0y as A->0. 

So /(“ + -/(^) _ <f>{x + h)-<l>{x) 

dx A_>o k h 




=Lim 

*-♦0 


f(u + k)-f{u) 
k 


Linj <l>{x+h)-^{x) dy 
A -+0 \ du 


du 

dx 


We use here the result (see 4.5 above) that the limit of^a product is 
the product of the separate limits. 

The rule clearly extends to the case where there are several “ inter- 



170 


^L\THEMATICAL ANALYSIS FOR ECONOMISTS 


mediate functions. For example, if y is a function of u where u 
is a function of v which, in its turn, is a function of a?, then 

dy _dy du dv 
dx~~du dv dx 


This follows from two successive applications of the simple fimction 
of a function rule given above. 

The success of the function of a function rule in practice depends 
largely on the introduction of an “ intermediate ” function u in such 
a way that the derivative is most conveniently obtained. The 
method of breaking up functions for the application of the rule is 
illustrated by the following examples 


Ex. 1. y ■■N/2rr* - 3, ^ 

Write y where u - - 3. 


and 80 ~ J2x* - 3 *=* :r ^ - 3) « 

dx du^ dx 


1 

'2Ju' 


4rX > 


2x 


J 2x^-3* 

Since u is only introduced for convenience of working, the final step 
is to get rid of this “ intermediate ” variable in terms of x. 


Ex. 

i.e. y = where m = 1 + a;. 


This result has been obtained before (7.4, Ex. 9) by other methods. 


Ex. 3. y = 


1 

3x + 2’ 


i.e. y = - where i4 = 3a; + 2. 

u 

5x(3*T2)“5m(«) ~(3a:+2)** 

which has also been obtained before. 


Ex. 4. y = {ax + b)^, 
i.e. y=u^ where w=aa; + 6. 

So • ^{ax + b) . a*=na(aa: + 6)**“^ 




i.e. * 


y^Ju where 


1 -fa; 



Now 


and 
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So 


1 jl -2x 
2Ju 2*1 +* 

s(ra)-(i4? 

[1+^ 1 IT 
* 1 - 2 * 2*1 


dx 


-2x 


1 


Tx (i - 2x)^^ 2 7(1 + x) ( 1 - 2x)^ * 

Each of these five examples is an instance of one general form 
which can be derived at once from the function of a function rule : 

d , , ^du 

where u is any given function of z. In the case n = - 1, we have 


_d 

dz 


(l\ _}_du 

\u) ~ u^dx* 


wjl|jch is the special case of the quotient rule already given. 


7.6 The inverse function rule. 

From the function of a function rule we can derive the fifth, and 
last, of the rules for derivation. This rule provides the derivative 
of the function inverse to a given function and it applies only if both 
the inverse functions are single-valued. 

Rule V. The derivative of an inverse function. 

The derivative of an inverse function is the reciprocal of the 
derivative of the original function : 

dx 1 
dy~dy" 
dx 

provided that both functions are single-valued. 

The proof is as follows. Suppose that the single- valued function 
y^(f){x) gives the single-valued inverse function x=^\ls{y). Then 
^{<l>{x)} must equal x for any value of x. So 

(by^ulelVy 

dx dy 


But 
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dx dt/ 

So, the product of -j- and ^ is unity and the rule is proved. 
dy (tx 

As an illiistration of the inverse function rule, we can take the 
case of the derivative of ^x. If y=Jx, then x=y*. The inverse 
function rule then gives 


^ ( j.t)— ^ — J_=—L_ . 

d . „ 2y 2Jx 

This result agrees, of course, with that obtained from the standard 
form giving the derivative of x^ for any value of w. 

The process of evaluating derivatives can now be regarded as 
complete. The five rules, taken in conjunction with a suitable table 
of standard forms, are sufficient for the derivation of all single- 
valued functions, no matter how complicated is their analytical 
expression. Even when functions of entirely new t 3 ^e 8 are intro- 
duced, it is only necessary to extend the table of standard forms 
by including the derivatives of the simplest instances of the 
new function types, as obtained directly from the definition of 
the derivative. Examples of this extension of the table of standard 
forms will be given later. 


7.7 The evaluation of second and higher order derivatives. 

The practical method of obtaining the second and higher order 
derivatives of a function introduces nothing that is new. Having 
obtained the first derivative by means of the rules set out above, 
the second derivative is obtained by a further application of the rules, 
this time applied to the first derivative considered as a function 
of X. The third, fourth and higher derivatives are then obtained in 
succession in the same way. 

It is possible, however, to extend the sum and product rules so 
that they apply directly to second and higher order derivatives. 
If u and V are two single-valued functions of x, we have 



dhi d*v 
dx^ ^ dx^ 
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In general, for any value of r which is a positive integer, 

d' , , d^v 

daj*’ ^ ±v)— 

The extension of the product rule is more difficult. From 
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we have 


da:* 


d , . dv du 

, , d I dv\ d ( du\ 

dx dx) ^ dx \ dx) 

( dH dudv\ fdudv dhi 
^ dx^^ dxdx) ^ \dxdx^^ dx^, 


\dxdx 
dhi 


_ dH dudv 
^dx^^ dxdx^^ dx^ 

The third derivative is then found by a further derivation process as 

/ X dH ^dudH ^dhidv dhi 

A general result for the rth derivative of the product is obtained, in 
this way, by repeated derivation : 


dx 




d^v du d'^-H r (r - 1 ) dhi d^'^H 


dxdx^-^ 


+ 


dx^ dx^-^^ 


r (r - 1 ) d'^~hi dH d’’“% dv d^u ^ 

2 dx^~^ dx^ ^ ^ dx^'~^ dx^^ dx^ 


This is known as Leibniz’s Theorem and the coefficients of the 
successive terms on the right-hand side follow the well-known 
“ binomial ” law. From the point of view of the present develop- 
ment, however, only the first and second derivatives are normally 
required and these can always be obtained by direct application of 
the simple rules given above. A number of examples are given for 
illustration : 


Ex.l. 


dx^ 


(*<) 


= 4a:* ; 
= 24; 






'0. 
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=^> ^ °“- 
=na:"-‘; ^(a:") =»(» - 1)^"-* ; 

d^ 

^ (a;^) = n (n - 1 ) (n - 2)a;'‘~* ; and so on. 

The general result for the rth derivative of this standard form can be 
written down by inspection : 

^(a;") =n(n - l)(n -2) ... (n -r + l)a;«”*‘. 


The previous two examples are particular cases of this standard result. 
If n is a positive integer, as in the first example, then the derivatives after 
the nth order are all zero. If n has any other value, the process of finding 
successive derivatives can be extended indefinitely without producing 
zero derivatives. 


l( 

' ^ ^ 

3 


f 1 > 

1 - • 

dx\ 

3a; + 2/ 

(3a: + 2)*’ 

da;*' 

v3a; -i-2> 

' (3x+2)»’ 

d* 

( ^ 1 

162 

and fi 

d 

0 

0 


dx^ 

\3a; + 2y 

(3a: +2)*’ 



Ex. 5. 

dx 


2x 


V2a:a-3’ 


dx» 


V2a:»-3 = 


2sl2x^-Z-2x^‘j2x^-Z 

dx 

2iF^ 


2 / 2a:* \ 6 

2a:* “7^233/ (V2*^3)*’ 

and similarly for higher order derivatives. 


Ex. 6. If f{x) denotes any single- valued function, then 

* =*/'(*:) +/W. 

W (*')} = ^ (*:)} + ^/(*') = */" (*) + 2/' (*). 


and* 
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Again, 

A. 

dx 

and 

II 


£ {^’} -/w>- W'W -/(*)>£*•)] 

+/'(*) -/'(X)} - 2 x{x/'(x) -/(X)}] 

xY'{x)-2xf'{x)+2f{x) 

" a:^ 


EXAMPLES Vn 
Practical derivation 


1. Write down, from the standard form, the derivatives of Vac*, — : and 

1 ^ — X* / 


1/7* 


2. Obtain the derivatives of l+2a;*~3a;*; (1-x)*; (a? - 2)(2j; + 1) | 
1 ! , 1 Va? 1 X + 1 1 + 2x 

2x + l’ !-»«' (x + 2)(x+3)' i-* + 2x*’ 


3. If a, 6 and c are constants, show that 

^ 

cte Vox + 6^ ~ (ox + b) 
dx 


2ox +6 


4. Show that 


/ 1 g d / 1 ^ 

Vox +6/ (ox +6)* dx Vox* + 6x + c/ (ox*+6x+c)*' 

( OjX + 6i\ _ Oi6, - Oafti 

OjX + 6,/ ~ (OjX + 6|)* * 


6. If /(x) =ax +6, find the derivatives of x/(x) and 


/w. 


1 — 

6. Verify that 1 +x +x* + ... +x" =— ^ . By means of derivative* 
deduce that 


1 +2x + 3x* + ... +nx’*“* = 


1 -X 

1 - x" nx" 

(1 -x)»“r^‘ 


7, It is given that y = 1 +ii* where w = 1 -x*. Find y as a function of x. 
Write down the derivatives of y with respect to u and of y and u with respect 

to*. Verify that 

dx du dx 


8. Find the derivatives of 


Vx + l 


and 


Vx -f 1 - Vx - 1 

9. Use the function of a function rule to find the derivatfves of 


Va*-x*; Vo*+x* and 

T a* +x* 

Deduce the last derivative from the other two by the quotient rule. 
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10. From the function of a function rule find the derivatives of 

(-S' - 

Multiply out these powers and deduce the derivatives by using the sum, pro« 
duct and quotient rules only. 

11. Find the derivative of each of the general expressions : 

ax*+L+cS and 

12. If J(x) is a single-valued function of x, express the derivatives of V f(x) 
and its reciprocal in terms of the derivative of J(x). 

18s If X and y satisfy the relation xy =a, show that 
dy y . dx X 
dx X dy y 

so verifying the inverse function rule in this case. Consider the relation 
=a in the same way. 

d f dy\ 

14. If j/ is a single-valued function of a?, show that -j-(x* ■\-y^) — 2\x ^-y-^ j . 

d\f x dx \ ctx/ 

Deduce that ^ if « and y satisfy the relation a;*+y* = l. Verify by 

y 

finding the derivative of t/ = Vl - a;*. 


15. Evaluate the second derivatives of — • and *^1 -a?*. 

^/a; 

1 vX •4'fi 

16. Find the first and second derivatives of r and of ? • Deduce 

ox+6 ox+o 

that successive derivatives of these functions differ only by a constant factor. 

d^y dy 

17. If aw = a, show that + 2-^ =0. Show that the same result holds 

for xy=a+bx. ^ 

18. If y=*’+i, show that + -4y =0. Does the same result 

X* ^ ctx ctx 

hold for the relation y =ax* 

X* 

19. Show that all derivatives of ox* -i- l>x + c of higher order them the second 
are zero. 

20. If S(x) is a single-valued function of x, obtain the third and fourth 
derivatives of x/(x) in terms of the derivatives of/(x). In general, deduce that 

and verify by Leibniz’s Theorem. 

21. Draw graphs of y =x* and y —Hx for a range of positive and negative 
values of x and verify that each function is monotonic increasing. How ia one 
graph to be obtained from the other ? Find the derivative of f/x and check that 
the tangent gradient to the curve y =4^x, as estimated from the graph, equals 
£iii8 derivative. Show that the derivative becomes infini te at x=0 and so 
that the tangent to the curve is Oy at this point. 



177 


THE TECHNIQUE OP DERIVATION 


22, Write *•, oj* and as products and deduce their derivatives from the 
derivative of x by the product rule. From the derivatives of x* and a?*, use 

the quotient rule to obtain the derivatives of ~ and and the inverse 

X X 

function rule to obtain the derivatives of s/x and f/x. Find the derivatives 
of x^ and by the function of a function rule from the derivative of V®* 


23. Generalise the results of the previous example to show how the standard 
form =nx^^ can bo deduced, for any rational value of n, from the rules 

of derivation and from the fact that ^(a?) = 1. 

CLX 


24. Find the derivative of (a - bx)x*. If a hemispherical bowl of radius 
10 inches is filled to a depth of x inches with water, it is known that the volume 
of water is ir(10 -aj)a;* cubic inches. Find the rate of increase of volume as 
the depth increases and an approximate value of the volume incre€ise when 
the depth increases from 5 to 6*1 inches. 


25. If water is poured into the bowl of the previous example at the rate of 

one cubic inch per second, show that the depth increases by — — — inches 
per second when the depth is x inches. wx(20 - 3a?) 


28. Find the equation of the tangent to the curve y =aa? + 6 + - at the point 
with abscissa ajj. Where is the teuigent parallel to Oa:T * 


27. A projectile travels <(1 +^*) feet in t seconds. Show that it moves with 
increasing velocity and acceleration. What is the velocity after 10 seconds? 
Find an approximate value of the distance travelled during a small time At 
from the tth second. 


28. After t seconds a body is a? =oi + iht* feet from its starting-point. What 
are the velocity and acceleration then? Show that the velocity ia n/o* + 26x 
feet per second when x feet have been covered. 


29# If the demand law is — c, show that total revenue decreases as 

x 

output increases, margmal revenue being a negative constant. 

30. In the case of the demand law p = (o - 6a?)*, show that the average and 
marginal revenue curves axe both parabolas, the former lying above the latter. 
Show that marginal revenue falls to a negative value and then rises to zero as 
output increases. Plot the total, average and marginal revenue curves in the 
case of the demand of Examples V, 10. 

31. Show that the demand law p=a-6a?* gives parabolic average and 
marginal revenue curves of similar shape, both falling continuously as output 
increases. Where is marginal revenue zero? 

32. What is the marginal revenue function for the demand p = N/a-6a?? 
Under the conditions of Examples V, 8, how many tenements must be rented 
per week before marginal revenue falls to zero ? 

0 * dp 

33* Show that marginal revenue can always be expressed pa 

Deduce that the gradient of the demand curve is numerically equal to - at the 

output where marginal revenue is zero. Check this resvdt in the cases of the 
particular demand laws of the two previous examples. 
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The total cost function is U = >Jax + h -he. Find an expression for 
marginal cost and show that it decreases as output increases. Plot the mar- 
ginal cost curve for the electricity output of Examples V, 17. 


35. If n=ax^-^^-hd is the total cost of an output x, show that the mar- 

ginal cost of the ' output is a 1 + • If h >c, deduce that marginal 

cost falls continuously as output increases. Draw the marginal cost curve 
of the tobacco manufacturer of Excunples V, 20. 


/' 86. If a firm can produce two chocolate “ lines ** according to the conditions 

of Examples V, 26, interpret the value of the derivative ^ . At what output 

is a small decrease in the production of one line accompanied by an equal 
increase in the other line T 


37. The output of a coal-mine is given in terms of tne number of men 
working per shift according to the relation of Examples V, 27. Draw a graph 
showing variation of output. Find expressions for average and marginal 
output when x men per shift are worked and draw the corresponding curves. 
Show that both curves are parabolas and that average output only equals 
marginal output when the former has its greatest value. 



CHAPTER Vni , 

APPLICATIONS OF DERIVATIVES 


8.1 The sign and magnitude of the derivative. 

We have seen that the value of a derivative can be interpreted in 
two ways. The derivative of a single-valued function f(x) measures 
the rate of change of the function and the tangent gradient of the 
curve y=f{x) at the point in question. In the present chapter we 
base certain very important appUcations of the derivative on these 
interpretations of its value. It will be noticed, however, that our 
results depend largely upon diagrammatic “ intuition ” and are not 
rigorously established. The less strict development is sufficient at 
this stage, but it can be added that it is possible to provide formal 
analytical proofs of our results in all cases.* 

The value of the derivative off{x) at the point x=a is/'(a). The 
meaning of the sign of /'(a) is evident. If /'(a) is positive, then the 
rate of change of f{x) is positive, i.e. f{x) increases as x increases 
through the value x=a. The tangent gradient of the curve y—f[x) 
is positive and the tangent and curve slope upwards from left to 
right at the point with abscissa a. The converse properties hold if 
/'(a) is negative. Hence, 

(1) f (®) > ^ implies that f{x) increases as x increases and that the 

curve y=f(x) rises from left to right at the point x=a, 

(2) /'(a)<0 implies that /(a;) decreases as x increases and that 

the curve y=f{x) falls from left to right at the point x=a. 
The numerical magnitude of the derivative f\a) then measures how 
rapidly the function /(a:) increases or decreases, and how steeply the 
curve y=j{x) rises or falls, at the point x=a. 

The results can be extended to indicate the nature of the function 
or curve over a whole range of values of the variable x. In order^to 

* Some of the analytical proofs are given at a later stage, when we consider 
the iii^portant result known as Taylor’s series. 
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866 wh6n th6 function increaaes or decreases, or whero tho curve 
rises or falls, it is only necessary to examine the sign of the derivative 
of the function. For all ranges of values of a; in which f{x) is positive, 
we know that /(a:) increases continuously as x increases and that the 
curve y=f(x) rises continuously from left to right, and conversely 
for ranges in which f'{x) is negative. 

One case of special interest now presents itself, the case where the 
derivative is zero at a given point. If /'(a) = 0, then f(x) is neither 
increasing nor decreasing, and the curve y=f(x) is neither rising nor 
falling, at the point x=a. The value of the function is momentarily 
stationary and the curve has a tangent parallel to Ox. The value 
of the function at such a point is called a stationary value and much 
of the following development is concerned with these valueB. Before 
proceeding, it is instructive to examine some particular functions and 
to obtain a general idea of the properties holding at their stationary 
values. 

Ex. 1. 

Here ^=4-2a;« -2(a:-2). 

The derivative is positive when a: <2, zero when a? —2 and negative 
when a:> 2. The value of the function increases at first, becomes station- 
ary at a; — 2 and then decreases as x 
increases beyond this value. The 
corresponding curve is a parabola 
rising to a peak at the point where 
a; =2, as shown in Fig. 47. In this 
case, we say that the function has a 
‘‘maximum** value y=4 at the point 
where a; =» 2. 

Ex. 2. i^=a;*-4a; + 8. 

Here ^ = 2a;~4 = 2(a;-2). 

The sign of the derivative is exactly 
opposite to that found in the previous case. The value of th^ function is 
again stationary at a;B2, but the function decreases as x increases up to 
the value 2 and increases as x increases beyond 2. The curve, graphed in 
Fig. 48, is a parabola with a lowest point at a; b 2. In this case, we say 
that the function has a “ minimum ’* value y *==4at the point where a; «2. 
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Ex. 3. y = 

Here ~ =6 -6a; + fa:*=f (a; ~2)*. 

The derivative is positive at all points except that it becomes zero at 
the single point a; = 2. The function is mono tonic increasing but has a 
stationary value y = 4 at a; = 2. The graph of the curve is shown in Fig. 49, 




from which it is seen that the curve always rises except at the single point 
where the tangent is parallel to the axis Ox, The stationary value, in this 
case, is neither a “ maximum ** nor a ‘‘ minimum The point concerned 
is an example of what is called a “ point of inflexion a name derived 
from the fact that the curve crosses over the (horizontal) tangent as we 
pass through the point. 

8.2 Maximum and minimum values. 

It is now necessary to make more precise the meaning of the terms 
maximum and minimum values, which, we have seen., are connected 
with the stationary values of a function. 

Definition ; The function f{x) has a maximum (minimum) value 
at a point where the value off{x) is greater (less) than aU values in the 
immediate neighbourhood of the point. • 

The maximum and minimum values together can be termed the 
extreme values of the function. • 

It is assumed, in the following development, that the function and 
its derivative are jSnite and continuous at all points. The coAe- 
sponding curve is th^n smooth, being free of discontinuities and 
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“ sharp points Fig. 60 shows a hypothetical, and rather artificial, 
curve of this nature, drawn for the purpose of indicating the various 
possibilities that can arise. 

It is clear, in the first place, that a maximum or minimum value 
o{f(x) can occur only at a stationary point where the curve y=f{x) 
has a horizontal tangent. If the tangent slopes upwards at any 
point, there are larger values of f{x) immediately to the right of the 
point. If the tangent slopes downwards, there- are larger values of 
f{x) immediately to the left of the point. Neither of these cases is 



possible, by definition, at a point where f{x) is a maximum. The 
tangent can, therefore, only be horizontal at a maximum point 
and (similarly) at a minimum point. Thus all the maximum and 
minimum values of a function are iricluded amongst the stationary 
v-alues. 

In the second place, of all the points where the tangent to the 
curve y=f(x) is horizontal, there are some points (such as A and D) 
giving maximum values and other pointSs (such as B and E) giving 
minimum values of f{x). But there remctins the possibility of a 
third kind of point, such as the “ inflexional point C, where the 
function has neither a maximum nor a minimum value. Stationary 
values, therefore; include cases other than maximum and minimum 
values. 

A method for distinguishing diSerent kinds of stationary values is 
suggested by Fig. 50.‘ The tangent gra^ent pf the curve y=f(x) is 
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positive (the tangent being upward sloping) immediately to the left 
and negative (the tangent being downward sloping) immediately to 
the right of a point where f{x) has a maximum value. Hence, the 
derivative f'{x) changes from positive, through zero; to negative 
values as x increases through a value giving a maximum of f(x). At 
a point where the function has a minimum value, the tangent 
gradient and derivative change sign in the opposite sense. Finally, 
at a point where there is a stationary value which is neither maxi' 
mum nor minimum, the zero value of the tangent gradient and 
derivative does not mark a change in their sign at all ; they have 
the same sign on each side of the point. 

These results, indicated by inspection of the curve of Pig. 50, can 
be set out in precise analytical form': 


CRITERION FOR MAXIMUM AND MINIMUM VALUES 


(1) All maximum and minimum values of the single-valued 

function f(x) are stationary values and occur where /'(a:) is' 
zero. 

(2) If f'{x) changes in sign from^positive to negative as x increases 

through a value a where /'(h) = 0, then /(a) is a maxin^um 
value of the function / (a:). j '5 

If f\x) changes sign from negative to positive under 
the same conditions, then /(a) is a minimum value of the 
function. 

If f'(x) does not change sign under the same conditions, then 
f(a) is neither a maximum nor a minimum value of the 
function. 

There are thus two conditions to consider. The first is a “ neces- 


sary ” condition for an extreme value, while the second adds the 
“sufficient ’’ condition which enables us to distinguish maximum, 
minimum and other stationary values. The conditions together make 
up “necessary and sufficient’’ conditions (see 8.9 below). We can 
notice an ''alternative, but less exact, way of expressing the necessary 
condition. If y=f[x) has a stationary value (e.^ a maximum or 
minimum value) at a point, then y is momentarily steady in value 
and = 0 approximately for any small change in x from the point 
Thus when treating thf function at a maximum or minimum position, 
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we can regard it as a constant for small variations about the position. 
In any case, the value of the function always changes very slowly 
from such a position.* 

It must be emphasised that a maximum value of a frinction is 
not necessarily the ‘‘ greatest ’’ value of the function. A maximum 
value is greater than all neighbouring values but there may be still 

greater values of the ftmction at 
more remote points. Similarly, a 
minimum value need not be the 
“ smallest ’’ value. There is only 
one greatest and only one least 
value of the function but there 
may be several alternating maxi- 
mum and minimum values. This 
is clear from Fig. 50. Further, it 
is important to realise that every 
stationary value is not an extreme 
value and we should always be 
on the look-out for “inflexionar* 
stationary values. Finally, our criterion may break down and fail 
to show a maximum or minimum value if the function or its 
derivative is not continuous. Fig. 61 illustrates the fact that a 
maximum value, occurring at a “sharp point” of the curve, may 
not be indicated by the criterion we have given. 

8.3 Applications of the second derivative. 

The second derivative /"(a;) of a function, being the derivative of 
the first derivative f\x), measures the rate of increase or decrease 
of /'(a:), i.e. the rate of increase or decrease of the tangent gradient 
to the curve y =/(a;) as we pass through the point concerned. The 
sign of f'\x) at any point x^a provides some useful information. 
If /"(a) is positive, then /(a?) is changing at an increasing rate as 
X increases through a and the tangent gradient to the curve y=zf{x) 
increases as we pass through the point with abscissa a. Tjtie tangent 
to the curve turns in the anticlockwise direction and the curve is 
convex when viewed from below at this point. Conversely, if /"(a) 

* The device of keeping a maximised or minimised variable constant is 
often adopted by Wicksell. See, for example, Lectures on Political Economy 
(Ed. Bobbins, 1934), Vol. I, p. 181. t 
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is negative, then /(a;) changes at a decreasing rate as x increases, the 
tangent to the curve turns in the clockwise direction and the curve 
is concave from below at the point where x=a. These results, which 
are fully illustrated in Fig. 52, are independent of the value of the 
derivative /'(a), of whether the tangent to the curve slopes upwards, 
downwards or is horizontal at the point where x=a* Hence : 

(1) f*'(a)> 0 implies that the function f(x) changes at an increasing 

rate as x increases through the value a and that the curve 
y=zf[x) is convex from below at the point x—a, 

(2) /"(a) <0 implies that the function f{x) changes at a decreasing 

rate as x increases through the value a and that the curve 
y=f(x) is concave from below at the point x=a. 
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The numerical magnitude of /"(«) then indicates how rapidly the 
value of f{x) is “ accelerating and how great is the curvature of 
the curve y=^f{x) at the point a:=a. AU questions relating to the 
nature and extent of the curvature of a curve are answered by an 

♦ But the results, which are obvious enough when / (x) increases and the 
tangent slopes upwards, should be interpreted with care when f{x) decreases 
and the tangent slopes downwards. When we say that /'(a)< 0 and f"(a) >0 
implies that f(x) is decreasing and changing at an increasing rate at a; = a, we 
mean that the numerical rate of decrease of/(a?) is getting less (and not greater) 
as X increases. A negative quantity, which is getting less in numerical magni- 
tude, is increasing. 
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examination of the value of the second derivative of the function 
concerned. 

An alternative criterion for extreme values of a function can be 
given in terms of the second derivative. It is assumed that the 
function f(x) is finite, continuous and possesses continuous first and 
second derivatives. If /'(a) is zero and /"(a) negative, then f'{x) 
must decrease through zero as x increases through a, i.e. the change 
in sign of f'{x) is from positive to negative and x==a must give a 
maximum value of the function. In the same way, we see that a 
minimum value of the fimction is obtained at a;=a when /'(a) is 
zero and f"(a) positive. Hence : 


ALTERNATIVE CRITERION FOR MAXIMUM AND MINIMUM 

VALUES 



All maximum and minimum values of f{x) occur where 

r{x)=o. 


(2) If /'(a) = 0 and /"(a) <0, then f{a) is a maximum value of the 
function. If /'(a) = 0 and /"(a)> 0, then f{a) is a minimum 
value of the function. 


The first condition is, as before, a ** necessary condition. The 
second is a sufficient ” condition indicating situations in which only 
a maximum (or only a minimum) value off{x) can, occur. But it is 
not complete, i.e. both “ necessary and sufficient ’’ (see 8.9 below), 
since the case where /"(a) is zero is not covered. This case includes 
all inflexional stationary values and may include other maximum or 
minimum values as well. The alternative criterion, though often 
more useful, is somewhat more limited in its scope than the first 
criterion. 


8.4 Practical methods of finding maximum and minimum values. 

In practical problems, we require the maximum or minimum 
values attained by one variable (y) as the value of another variable 
(x) is changed. If the variables are related by a single-valued 
function y=f{xy) we proceed : 

(1) The derivative /'(a;) is obtained. 

'(2) The equation /'(a;)=0 is solved to give a number of solutions 
a;=a, «=&, x=c, ... . I ^ 
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(3) Each solution (e.g. a;=a) is taken in turn and examined. 

Either: the change in sign of /'(a-fA) as h changes jfrom 
negative to positive is determined. If the change is from 
positive to negative, f{a)iaa, maximum value of /(as). If the 
change is from negative to positive, f(a) is a minimum value 
of f{x). If there is no change of sign, /(a) is an inflexional 
value off{x). 

Or : /"(a) is obtained and its sign determined. If /"(a) is 
negative, /(a) is a maximum value of f{x). If /"(a) is 
positive, f(a) is a minimum value of /(a;). 

If this process is carried out in the cases of the three simple 
functions quoted in 8.1 above, it is found that one stationary value 
y = 4 occurs at a; = 2 in each case. This is a maximum value, a mini- 
mum value and an inflexional value in the three cases respectively. 
The following examples provide further illustrations. 

Ex. 1. y =a:*--3x*+6. 

Here ^ = 3a:»-6a; = 3a;(a:-2) ; ^ = 6a:-6=6(a:-l). 

The stationary values of y occur where Z^{x - 2) =0, i.e. at a; = 0 and at 
a; = 2. The corresponding values of y are y = 5 and y = 1. To distin^ish 
between maximum and minimum values, 

*^** = 2. S = 6>0. 

The function has a maximum value 5 at a;=0 and a minimum value 
1 at a; = 2. The graph of Fig. 56 below illustrates. 

Ex. 2. y^x^- 4tx^ + 16a;. 

Here ^ = 4a;»-12a:* + 16 ^ = 12a;*-24iE 

ax dx^ 

=4(a; - 2)2(a; + 1). = 12x(a; - 2). 

There are stationary values y = -11 ata; = -1 and y = 16 at a; = 2. At 
a;= ~ 1, the value of the second derivative is 36. The function thus has 
a minimum value of -11 at this point. At x = 2, the second derivative 
is zero ana the nature of the stationary value here remains undecided. 
To settle this point, the first derivative at a; = 2 + A has value 44* (A +3). 
This is positive for all small (positive or negative) values of A. “[nie 
function has an inflexional stationary value at a?=*2. The graph of 
Fig. 57 below illustrate% 



188 


MATHEMATICAL ANALYSIS FOR ECONOMISTS 


Ex. 3. 


2a; + l 


Rere - 


dx 


(x-ir 

which is negative for all values of x. The function has no stationary, 
and so no maximum or minimum, values. The curve representing this 
function is a rectangular hyperbola with asymptotes parallel to the axes, 
a curve which clearly has no tangent parallel to Ox, 


Ex. 4. y = 


2 -X 


Here ^ 


dx 


x^-^x 

x{x 


-2 

-4) 


(a;2 + a;-2)2 

There are stationary values y«--lata:=0 and y = at a; 
now examine the sign of the derivative near these points : 

dy A(A-4) dy_ ^(^ + 4) 

dx 


= 4. We 


_ - dy ^ 

’ d^~ (A* +9A + 18)* 


at a: =4+ A. 


(A2-fA-2)2 

The first expression changes from positive to negative as A is given 

small values changing from 
negative to positive. The 
second expression changes in 
the opposite sense as A is varied 
similarly. The function thus 
has a maximum value at 
a;=0 and a minimum value 
- J at a: =4. 

The curious feature of this 
case is that the maximum 
value of the function is smaller 
than the minimum value. This 
apparently paradoxical result 
is due to the fact that the 
function has infinite values, at 
a: = 1 and at a: = - 2. (At each 
of these values the denomin- 
ator of y is zero.) The graph of the function, shown in Fig. 53, illustrates 
how the presence of infinities influences the maximum and minimum 
values. ' ® 



Ex. 5. y 

i 

Here 


=aa:*+6a;+c. 


^^2ax'\‘b\ ^»2a. 
dx dofl 
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There is one stationary value j - — at This value is 

Jlid 

a maximum if a is negative and a minimum if a is positive. The curve 
corresponding to this function is a parabola with axis vertical. The 
stationary value occurs at the vertex of the curve and is a maximum or 
minimum according to the direction of the axis as determined by the sign 
of a (see 3.4 above). 


Ex. 6. where o, h and c are positive constants and x is 


assumed to take only positive values. 


Here 


dy _ c 
dx ^ x^' 


d^y _2c 
dx^ 


Since x is positive, there is only one stationary value and this occurs 


at the point x 


-4 


The second derivative is positive. So, the function 


has a minimum value, y ^2jac+h at this point. 


Ex. 7. y -6a;+c+- , where a, 6, c and d are positive constants 


and X is restricted to positive values. 
Here 




The stationary values occur at points where the derivative is zero, i.e. 
where x satisties the relation : 


2ax^ “d=0. 


It can be shown that this cubic equation has only one positive root, and 

that the value of this root is greater than ^ = ^ * The second derivative 

is seen to be positive. The function has thus a single minimum valve for 
the range of positive values of x. 


Ex. 8. An open box consists of a square base with vertical sides and 
has a volume of 4 cubic feet. What are the dimensions of the box for a 
minimum surface area ? 

Let X feet be the side of the square base and h feet the depth of the 
box. Then &c*=4 since the volume is given as 4 cubic feet. From this 
we obtain h in terms of oj. The surface area is 

y =|a:* + 4Aa: =a;* square feet. 
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So, y is written as a function of x only and its minimum value is 
required for variation in x (which alone governs the dimensions of the 
box). The values of x for stationary values of y are given by 

1-2. -^-0. U.:^-8. 

Hence x^2 gives the only stationary value of y. Since 

^ = 2+~=6>0, when a; = 2, 
dx^ 

the stationary value is a minimum. The minimum surface area is thus 
12 square feet and the dimensions of the box are then : side of base = 2 feet 
and depth of box = 1 foot. The depth of the box is half the side of the 
square base. 


8.5 A general problem of average and marginal values. 

From a given single-valued function f(x) at a point a:, we derive 
the pair of values : 

f (x) 

Average value of f(x) • Marginal value of f{x) =f\x). 

X 


The average value is taken over the whole range from zero to the 
given value x ; the marginal value refers to the “ margin at the 
given value x. In diagrammatic terms, if P is the point with 
abscissa x on the curve y=fix), then the average value of f{x) is 
represented by the gradient of the “ radius vector OP and the 
marginal value by the gradient of the tangent at P. The application 
of these general concepts in the economic problems of demand and 
cost have been considered at an earlier stage (6.8 above). 

It is now assumed, for convenience, that both x and /(a:) take only 
positive values. It is required to find the values of x which corre- 


spond to maximum or minimum values of the average 


m. 

X 


So 


d Ifix)] xf'{x)-f{x) 

dx\ X j a:* ’ 

d* (/(x)] _ x^f"{x) - 2xf’{x) + 2f(x) 
da:*\ X } X* * * 

• . ‘ fix) 

Stationary values of occur where xf'ix) -f(x)=0, i.e. where 

X 


fix) 


=/'(*). 


z 


I 
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At such a point, the value of the second derivative reduces to 

X J X 

and, since x is taken as positive, this has the sign of /"(a?). The 
average value of f{x) is thus stationary at any point where the 
average and marginal values oif(x) are equal. The stationary value 
is a maximum if f”[x) is negative at the point ; it is a minimum if 
f\x) is positive at the point. 

In diagrammatic terms, a stationary value of the average value 
of f(x) is shown by a point P on the curve y=f(x) where the tangent 
coincides with the radius vector OP, i.e. at a point where the tangent 




Fio. 64. 


passes through the origin 0. The average value is a maximum if the 
curve is concave from below at the point P ; it is a minimum if the 
curve is convex from below at the point. These facts are clear fix)m 
the two cases illustrated in Fig. 54. 

8.6 Points of inflexion. 

A single-valued function y =f{x) is defined to have an inflexional 

value at a point where the corresponding curve crosses from one side 

of its tangent to the other. The point itself is described as a point 

of inflexion. The most important property of a point of inflexion is 

that it marks a change in curvature, the curve changing jfrom convex 

to concave from below as we pass from left to right through the 

point, or conversely.* This property is clear fram the inflexional 

• 

* If the funotion is single-valued and the curve smooth, then all changes of 
curvature occur at points of inflexion. For multi-valued functions, on Jbhe 
other hand, a change in curvature may occur where the curve “ turns back 
on itself ” and the tangei^t is vertical. Such a point is not a point of inflexion. 
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cases shown in Fig. 66. There are two classes of points of inflexion. 
A point of one class (as illustrated in the first two cases of Fig. 66) 
marks a change of cirrvature from convex to concave from below 8ts 
we move from left to right along the curve. A point of the other 
class marks a change of curvature in the opposite sense (as shown in 
the second two cases of Fig. 55). The actual tangent at the point of 
inflexion is not restricted in any way ; it can slope upwards or down- 
wards with any numerical gradient whatever. Upward and down- 
ward sloping tangents are shown in Fig. 65 for each of the two classes 
of inflexional points. Further, as limiting cases, the tangent can be 
parallel to Ox with zero gradient or parallel to Oy with infinite gradient. 



Fio. 55. 

In addition to the change of curvature property, another charac- 
teristic of points of mflexion is evident from Fig. 65. A point of 
inflexion always corresponds to an extreme value of the tangent 
gradient of the curve. At a point of inflexion of the firsh class, the 
tangent 'gradient's a maximum, £he gradient increasing as we move 
from the left towards the point and decreasing as we move to the right 
away from the point. At a point of inflexion of the second class, it is 
seen, in the same way, that the tangent gradient is a minimum. 
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Assuming that the single-valued function f(x) is finite and con- 
tinuous with continuous first and second derivatives, it is a simple 
matter to express the properties of an inflexional value of f{x) in 
analytical terms. The derivative /'(a;), as the measure of the tangent 
gradient, must have an extreme value at any point of inflexion. It 
is necessary, therefore, that the second derivative f^'(x), being the 
derivative of f'{x), has zero value at the point. Further, the value 
of f"{x) must change in sign as x increases through the point con- 
cerned, the direction of the change determining the class to which 
the point of inflexion belongs. If f"(x) changes firom positive to 
negative, the derivative f'{x) has a maximum value and the curva- 
ture changes from convex to concave from below, i.e. the point of 
inflexion is of the flrst class. The converse change of sign of f"{x) 
indicates a point of inflexion of the other class. So 

, CRITERION FOR POINTS OF INFLEXION 

(1) An inflexional value of the function f{x) can only occur at a 

point where / "(a;) = 0. 

(2) lf/"(a) = 0, and if /"(a;) changes in sign as x increases through 

the value a, then f{a) is an inflexional value of the function 
f{x). The direction of the change of sign of /"(x) indicates 
the class of the point of inflexion. 

The first condition is necessary ” for points of inflexion. The 
second adds the sufficient ” condition and, altogether, we have a 
criterion which is complete, i.e. “ necessary and sufficient (see 
8.9 below). 

If the function is assumed to have a continuous third derivative, 
an alternative form of the criterion can be given in which this 
derivative is used. If /'"(a) is negative at a point where /"(a) is 
zero, then/'(a;) is a maximum at the point a:=a, i.e. we have a point 
of inflexion of the first class. Similarly, if /"'(a) is positive at the 
point where /"(a) is zero, we have a point of inflexion of the second 
class. Hence, if/"(a) =0 and/'" (a) ^0, there is a point of inflexion 
at a;=a apd the sign of the non-zero third derivative indicates to 
which class the infle^onal point belongs. This alternative* form of 
the criterion is not complete (not “ necessary and sufficient ") since 
the case where the third derivative is zero is not considered. 

A point of inflexion^ as we have remarked, is in no way dependent 
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on the value assumed by the first derivative of the function at the 
point. It may happen, however, that the first derivative is zero at 
the point of inflexion, the value of the function being stationary 
as well as inflexional. The second, and incomplete, criterion for 
stationary values (8.3 above) can now be extended slightly by the 
addition of the condition : 

If ^ fiP) ^ stationary and 

inflexional value of the function /(a:). 

The criterion is still incomplete since it takes no account of cases 
where the third derivative is zero.* 

As long as the third derivative of the function f{x) is not zero at 
the point a;=a, the following scheme indicates all the possible cases 
of stationary and inflexional values of f{x ) : 

Stationary value Inflexional value 

/ \ / \ 

Extreme Stationary and Inflexional and 

value inflexional value non-stationary value 

/'(a)=0 f"{a) #0 /'(a)=/"(a) = 0 /'(a) ^0 /"(a)=0 

Two examples will illustrate the method of locating inflexional 
sralues in the cases of particular functions : 

Ex. 1. y=a:*~3a;*+5. 

g-6(.-I); g-6. 

There is thus only one inflexional value of the function, i.e. ^»3 at 
X = 1. The third derivative is positive and the second derivative changes 
•sign from negative to positive as we pass from left to right through the 
point of inflexion. The point is thus an inflexion of the second class, the 
curvature of the curve changing from concave to convex and the tangent 
gradient ( - 3) being a minimum. Fig. 56 indicates the point of inflexion 
P and also the maximum and minimum points A and B on the curve 
representing this function. 

Ex. 2. y = + 16a;. 

’ ^ = 4(»^2)*(j: + 1); ^ = 12x(x-2); ^ = 24(*-1). 

t The criterion is fully completed in 17.6 below, where extreme and in- 
flexional values are determined by the first non-zero derivative of f(x) of 
whatever order it may be. 
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There are two points of inflexion. At rr =» 0 there is the inflexional yalue 
; at a: = 2 there is the inflexional value y = 16. At the first of these 
points the third derivative is negative. The point is an inflexion of the 
first class, the curvature changing from convex to concave and the tangent 



gradient (16) being a maximum. At the second of the points the third 
derivative is positive. The point is an inflexion of the second class, the 
curvature changing from concave to convex. Further, the first derivative 
is also zero at the point and we have here a stationary as well as an 



inflexional value of the function. The zero tangent gradient' at the point 
is a minimum value of the tangent gradient. Fig. 57 indicates the gr§.ph 
of the curve representing the function and shows the two inflexional 
points in addition to tUb single minimum point. 
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8.7 Monopoly problems in economic theory. 

Many of the problems of physics and other mathematical sciences 
can be reduced to the determination of positions of maximum or 
minimum. It is sufficient to instance the importance of maximum 
or minimum potential, energy, action and entropy. An exactly 
similar situation is found in certain branches of economic theory. In 
static problems, we find it convenient to assume that the individual 
consumer seeks the highest position on his “ preference scale ’’ 
consistent with market conditions, that the individual firm fixes 
output or price to produce the largest net revenue and organises its 
factors of production to give the largest output at a ^ven cost or the 
smallest cost for a given output. Similar assumed principles, in 
more complicated forms, are to be found in dynamic problems. If 
functions of a single variable suffice to interpret the phenomena 
concerned, then the methods of the present chapter apply at once in 
the solution of our problems. Some simple examples, intended to 
illustrate the method, are given in the following paragraphs.* 

As a first problem, suppose that a firm produces a good X under 
Known cost conditions represented by the total cost fimction /I=-F(a;). 
The demand of the firm’s market for the good X is assumed to be 
known and represented by the demand function a: = ^ (p) or p = ^ (a;), 
where x is the demand at price p. Within the limits set by this 
demand relation, the firm is assumed to act as a monopolist with the 
object of maximising net revenue. Two alternative points of view 
can be taken. Either : the firm fixes its output and leaves the price 
to be determined by the demand conditions. Or : the firm fixes its 
price and the demand conditions determine the appropriate output. 
The analysis of the problem is different in the two cases but, as we 
shall see, the results obtained are effectively identical. 

Assuming that the firm fixes output, then the price to clear any 
output X must be p = 0(a;) as given by the demand conditions. The 
gross revenue from output x is the total cost is n—F{x) 

and the.net revei^ue is (E- 17) given as a fimction of x. i'be output 

* For an account of the importance of maximum or mmimnm positions, 
pafticularly in relation to “ loose indefinite relations *’ between economic 
variables, see Edgeworth, McUhematical P9ychica (1881, reprinted 1932), pp. 
1>15 and pp. 83-93. ^ 
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X fixed by the firm for maximum net revenue must satisfy the two 
conditions : 

|(ii-J7)=0 and 


The first condition is 


At the output for 


dR dR dn 

dx dx ' ' * dx^ dx* 
equUibrium, therefore, marginal revenue must equal marginal cost. 
The second condition, for a maximum as opposed to a minimum, is 


A 

dx 



'~dx^ dx^^^^ 


d^R dm 
dx^ ^ dx^ 


Hence, at the equilibrium output, marginal revenue must be in- 
creasing less rapidly than marginal cost. This second condition is 



automatically satisfied if, for example, marginal revenue is decreas- 
ing while marginal cost is increasing as output increases from the 
value at which these marginal concepts are equal. 

The position can be represented on a diagram in two different 
ways. The total revenue and cost curves can be flrawn on*one dia- 
gram, taking output x along the horizontal axis and revenue or cost 
along the vertical axis. Suppose that, on the two curves, pointd in 
the same vertical linf can be found such that the tangents to the 
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respective curves are parallel 



The output common to 


such points is a monopoly equilibrium output provided only that the 
total revenue curve is less convex (or more concave) from below than 

the total cost curve \ . When the total revenue and cost 

\dx^ dx^ J 


curves are of “ normal form (see 10.7-8 below), the position is 
illustrated by the curves of Pig. 58. The net revenue obtainable 
from any output is shown by the vertical distance of the total revenue 
curve above the total cost curve. This is clearly a maximum in the 
position PQ^ the tangent to the total cost curve at P being parallel 
to the tangent to the total revenue curve at Q. The monopoly out- 
put OM is thus uniquely determined in this case and the monopoly 
price is read off as the gradient of 0^. 



The monopoly situation can also be represented on a diagram show- 
ing the average and marginal revenue and cost curves referred to the 
same axes and scales. Fig. 59 exhibits these four Qurves as obtained 
from the total curves of Fig. 58. Three of the curves happen to Le 
straight lines, but this is not an essential feature of the ‘4 nonnal ’’ 
case herS taken. •The unique monopoly output is now given as the 
abscissa OM of the point P where the marginal curves intersect. 
Thb monopoly price is MRy the average cost of the monopoly output 
is MQ and the maximised net revenue is Qft times OAf, i.e. the 
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rectangular area shaded. The condition for a maximum, as opposed 
to a minimum, is automatically satisfied here since the marginal 
revenue curve falls while the marginal cost curve rises.* 

The analytical solution of the monopoly problem can be illustrated 
by assuming particular cost and demand functions of simple type 
appropriate to “ normal ** conditions. If the cost function is quad- 
ratic in form 11 = ax^ -{-bx + c and if the demand function is the linear 
form p=p-oLX (all the constants specified being positive), then the 
first condition of equality of marginal cost and revenue is 

2ax + b~P -2ocx, 
p-b 


i.e 


x = - 


2 (a -h a) 

There is a unique equilibrium output, provided that j3> b. This last 
condition is simply that marginal revenue is greater than marginal 
cost at zero, or very small, output. If it is not satisfied, the firm 
never makes a positive profit and its losses are least when it produces 


nothing. The second condition for equilibrium 


dm \ 
^ dx^ ) 


IS 


always satisfied. Figs. 58 and 59 are drawn for cost and demand 
functions of these types. A sugar refinery produces an output of 
X tons per month at a total cost of £( 5 - 0 - 0 ;^ -h 15a; + 800) and mono- 
polises the sale of sugar on a market with a demand law p = 50 - ^x, 
where £p is the price (per ton) of sugar. The monopoly equilibrium 
output is just under 150 tons per month and the monopoly price is 
approximately £35 per ton or rather under 4d. per lb. 

Taking the second view of the monopoly problem, the firm fixes 
the price, and its output to meet the demand must be x = ^(p). 
Here 

Ji = xp=p<^(p) and IT=F(x) =F{(/>(p)}. 

The net revenue at the price p is (F- IT) and this is a maximum if 


/I 

and ^.iB-n)<0. 

• • 

• It is to be noticed that, os far as our formal solution is concerned, there is 
no reason to suppose that net revenue is positive even when it is maximised. 
The total cost curve may lie completely above the total revenue curve. *In 
this case, the firm either goes out of business or stays in and cuts its losses. 
Maximum net revenue isfthen minimum net loss. 
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The first condition is 

d f f . dTl _ 

5WW>-^=o. 

The equation to be solved for the- equilibrium price is thus 

<f>ip) + [p-^)4>'(p)=^- 

The limiting condition for maximum net revenue is 

# (J)) + (p - ^ #'(}))| < 0, 

Any value of p satisfying the above equation and inequality is a 
possible monopoly price and the corresponding output is x = ^(p). 

The second analysis of the monopoly problem, which is due to 
Cournot, can be shown to lead to the same equilibrium price and 
output as is obtained in the first analysis. This fact can easily be 
checked in particular cases, e.g. with a quadratic cost function and 
a linear demand function. Or, in general terms, the above equation 
for the monopoly price gives 

d^i d-R dx 

dTI _ ^ {p) +p<f>\p) _dp _ dx dp _ dR 
dx <f>\p) dx dx dx^ 

dp dp 

i.e. the condition of the first analysis that marginal cost and marginal 
revenue are equal is satisfied also in the condition of the second 
analysis. 

8.8 Problems of duopoly. 

The demand of a market for a good X is represented,«as before, 
by the 'demand l-elation p = iff{x) connecting the price p and the 
demand x. The production of the good is shared between two duo- 
polist firms selling at the same price p. The first duopolist produces 
an output 0 ^ at a total cost of TTi=F^{xi) an<J the second duopolist 


i.e. 


dp] 
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produces an output Xg at a total cost of n^—F^{x^. The solution 
of the problem of the distribution of the market between the two 
duopolists depends entirely upon what is assumed about the reaction 
of one duopoUst to any action on the part of the other. 

It is assumed, in the simplest duopoly problem, that each duo- 
polist expects the other to make no change in current output no 
matter what changes he makes in his own output. Subject to this 
expectation, each duopolist then aims at fixing his output for 
maximum net revenue. If outputs x^ and x^ are fixed by the two 
duopolists, the price of the good is determined by p = ^{x), where 
a: -f ajg is the total output. The net revenue of the first duopolist 
is (x^ - i 7 i) and, for this to be a maximum, x^ must be chosen so 
that 




i.e. 


d dill 


This is the familiar equality between marginal revenue and mar- 
ginal cost. The diflSculty here is to express marginal revenue in a 
suitable form. Since Xip^x^tp{x)y where x=Xi’hX2, we have 


^ (XiP) = tp{x)+Xi^tP{x) {Xi + X^)=tj, (x) + Xilf>'(x), 

making use of the assumption that the first duopolist considers X2 
as fixed. Hence, for any given output x^ of the second duopolist, 
the equation which determines the first duopolist’s output is 

dUi 


i//{x)-\-Xiilf\x) = 


dxi 


In the same way, given any output Xi of the first duopolist, the 
second duopolist fixes his output X2 so that 

»li(x)+Xt>l>'(x)=^. 

These two equations are together sufficient, in general, to determine 
the outputs of the two duopolists. The total output and the price 
at which it is sold then follow at once. 

The first»equation gives the output of the first duopolist in terms 
of whatever output the second duopolist is producifig, i.e. it gives Xi 
as a function of Xg. It can be taken, under “ normal ” conditions, 
that an increase in x^ results in a decrease in Xi of smaller amount. 
The dependence of x^ on x^ can be represented by a ‘‘ reaction 
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curve ** Cl in the plane 0x^X2. The ** normal form of Ci is shown 
in Fig. 60. The curve must be considered in relation to the axis Oxi 
and its gradient to this axis is negative but numerically less than 
unity. Similarly, the second equation gives a;, as a function of Xi 

and a second reaction curve C7, 
is obtained. This curve is 
related to the axis Oxi and, in 
the “ normal ’’ case, its gradient 
to this axis is negative and 
numerically less than unity. 
The two curves Ci and (7a thus 
intersect in a single point P and 
the co-ordinates of P give the 
equilibrium outputs Xi and a:, 
of the two duopolists. 

In the particular case where 
the duopolists have the same 
total cost function 11 =F{x), the equations giving their outputs are 
^{x)’\-Xiijj'(x) =zF'{xi) and ip{x) -\-X 2 if'{x)=F'{x 2 ). 

Th? reaction curve Ci when viewed from the axis Oa:a is now of 
exactly the same form as the reaction curve (7a viewed from the 
axis Oxi, It follows that Xi and X 2 must be equal at the point of 
intersection. Thus, a;S we expect, the total output is shared equally 
between the two duopolists : aj^ =a;a = The value of a; is given by 
^{x)^\x^ls\x)=^F\\x), 

Further, if each duopolist produces at constant total cost, the tota^ 
output shared equally between them is given by 

4,{x) + \x^'{x) = 0, 
i.e. t}i{x) + {^{x)+x>lt'{x)) = Q, 

i.e. ^(ar)+^{x0(x)} = O. 

The total output is such that the sum of the average and marginal 
revenue from the total demand p = ^(x) is equal to z^o. If the 
averagd and marginal revenue curves are as shown in Fig. 61, the 
total output under duopoly is given by ON. This output can be 
cchnpared easily with that under pure monopoly. The monopoly 
output of a single monopolist with constant^total cost is such that 
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marginal revenue is equal to marginal cost, i.e. is equal to zero. In 
Pig. 61, the monopoly output is OM, and this is less than the duopoly 
output ON, 

In conclusion, we can indicate the way in which the general 
duopoly problem can be analysed. It is assumed that the first 
duopolist, when he varies his own output expects the second duo- 
polist to react and vary his output x^ according to some definite law 



Fig. 61. 


X 2 —f{Xy), Thus, if he changes his output from a level x^^ he expcjcts 
his rival’s output to expand or contract at a rate indicated by the 
dx 

derivative Following Professor Frisch,* this derivative 

CiX-^ 

can be termed a “ conjectural variation ” and it may be positive or 
negative in value according to circumstances. For a maximum net 
revenue, the first duopolist’s marginal cost must equal 

^ (Xi p) = (a:) + a:i ^ ^ (x) (x, + x,) = ^ (x) + Xiiji'ix) (l + J*) ' 

The equation, which gives as a function of X 2 and determines the 
reaction curve Ci of the first duopolist, is now 


0(a;) + xif(x) ( 1 +^) 


dxi 


In the sau^e way, it is assumed that the second duopolist expects 
the output of the first duopolist to vary according 4o a defiilite law 


Xi=g(x 2 ). The derivative is again the conjectural varia- 

dx^ 


♦ Frisch, Monopole-Mypole, Nationalokonomisk Tidsskrift, 1933. 
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tion. The equation giving a;, as a function of and defining the 
reaction curve of the second duopolist is 

The two reaction curves, by their point or points of intersection, 
determine the duopoly distribution of output between the firms. 
Their forms depend on the nature of the conjectural variations of 
the two duopolists and, by making various assumptions about 
these conjectural variations, a whole series of duopoly problems can 
be defined and analysed. The problem will be elaborated at a later 
stage (13.9 below).* 

8.9 A note on necessary and sufficient conditions. 

In the above analysis we have had occasion to draw distinctions 
between ** necessary ** conditions, “ sufficient ’’ conditions and con- 
ditions which are ‘‘ necessary and sufficient The following 
observations serve to explain more fully the nature of these dis- 
tinctions and to illustrate their importance. 

To start with a simple example, we can examine conditions under 
which a four-sided figure is a rectangle. First, if the figure is a 
rectangle, then one of its angles must be a right angle. This is a 
necessary condition. While all recta ngles have a right angle for one 
angle, there are also other figures with the same property. If, how- 
ever, one angle of the figure is a right angle and aU sides are of the same 
length, then the figure must be a rectangle. This is a sufficient con- 
dition. All figures w ith th e property stated are rectangles, but there 
are some rectangles (i.e, those not squares) which do not display the 
property. The condition is thus not complete, not necessary and 
sufficient. Finally, if the figure is a rectangle, then one angle is a 
right angle and opposite sides are of equal lengths. Conversely, if a 
figure has one angle a right angle aif& opposite sides of equal lengths, 
then the figure is a rectangle. We have here a necessary and suffi- 
cient condition ; it is complete, including all rectangles and no figures 
other than rectapgles. * 

In general, a necessary condition for a certain property is such 

The analysis given above, in the case where the conjectural variations are 
zero, is based on the work of Cournot. For the general problem, see Hicks, 
The Theory of Monopoly, Econometrica, 1936. • 
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that, if the property holds, then the condition is satisfied. The con- 
dition is satisfied by all things with the property but may be satisfied 
also by things without the property. A sufficient condition is such 
that, if the condition is satisfied, then the property holds. The con- 
dition is satisfied by no thing without the property but may not be 
satisfied by some things with the property. A necessary and sufficient 
condition is one which holds in both the ways described — if the pro- 
perty holds, then the condition is satisfied; if the condition is 
satisfied, then the property holds. The condition is complete and 
includes all things with the property and no others. 

An important instance of the distinctions here drawn is provided 
by the conditions for maximum and minimum values of a function 
f{z) which has a finite and continuous derivative. A necessary con- 
dition for a maximum value at a ; =0 is that f'{a) = 0 . This condition 
is satisfied at all maximum positions but also at other positions (e.g. 
minimum or some inflexional positions). A sufficient condition for 
a maximum value is that / "(a) = 0 and /"(a) <0. We have a maxi- 
mum value whenever this condition is satisfied but some maximum 
values can occur even when the condition is not satisfied. A neces- 
sary and sufficient condition for a maximum value is that /'(a) = 0 
and that f'(x) changes sign from positive to negative as x increases 
through the value a. A maximum value must satisfy this condition 
and we have a maximum value whenever the condition is satisfied. 


EXAMPLES Vin 
General applications of derivatives 

1. Write down the derivative of 3a;* + 3x* + a; - 1 and show that this 

dy 

function is monotonic increasing. If y = oa;* + 6x* + ca; + d, express ^ as a 

square plus a constant term. Deduce that the function is monotonio if 
6*< 3ac and that it then increases or decreases according to the sign of o. 


2. Show, by means of derivatives, that y = ^ 


and t/ = 


1 - a; 


are both 


2a; + 1 1 + a; 

monotonio functions. Generalise by showing that t/ = -r- is alwa 3 rs a 

^ a^ + Of 

monotonic function. Wlien does it increase and when decrease? Illustrate 
these results by considering the shape of the rectangular hyperbolas which 
represent the function. 

Show that 2 / = a; + ~ has one maximum and one minimum value and tbiat 
the latter is larger than tlie former. Draw a graph to illustrate. 
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4. Show that y = 2a; - 1 + - has a single minimum value and is positive for 
all positive values of x, * 

6. Find the maximum and minimum values of y = a;* - 3a; - 1 and of 
y = 3a;* - 10a;* + 6a;* + 6. Illustrate graphically. 

(0 If y = a;* - 4a;* + 6a;* - 4a; - 3, show that — = 4 (a? - 1)*. Deduce, from 

the first criterion for maximum and minimum values, that y has a minimum 
value at »= 1. Why does the second criterion fail to give the result in this 
cc^eT 

1 _ aji 

7. Show that each of the functions y = and y = \^3 - a;* has a single 

A "1" a; 

maximum value. Draw rough graphs of the functions to illustrate. 

8. Show that y — x^l +a; has a minimum and y= -x^l-i-x a maximum 
value. Draw the graphs of the functions for a; > - 1 and deduce that the 
double- valued function y* = a;* ( 1 + ar) is continuous with a maximum and a 
minim mn at th e same value of a;. Then consider y = + Va;*(l +a;) and 
y = - •s/a;*(l + a;) as the two single- valued branches of this function, showing 
that each branch is continuous but without a derivative at a; = 0. Illustrate 
the difficvilty of defining derivatives for multi-valued fimctions. 

[Si Show that y* = a;(a;* - 1) can be divided into two single- valued branches, 
one^With a maximum and the other with a minimum value. Deduce that the 
curve representing the double-valued function is continuous, defined only for 
certain ranges of x and shows a vertical tangent at three points. 

Find the derivative of y^i/x* and show that it is infinite at a; = 0. 
Di^bkw a graph of the function and indicate its behaviour in the neighbourhood 
of the origin. Deduce that y has a minimum value at the origin which is not 
a stationary value. Contrast this function and its graph with y — f/x (see 
Excunples VII, 21). 

ft. Show that the perimeter of a rectangle of curea 16 square inches is least 
wheh the rectangle is a square of side 4 inches. 

a A rectangular area is to be marked off as a chicken run with one side 
along an existing wall. The other sides are marked by wire netting of which 
a given length is available. Show that the area of the nm is a maximum if 
one side is made twice the other, 

T3. A cricket field consists of a rectangle with a semicircular area at each 
end. The perimeter is to be used as a quarter-mile running track. Find the 
dimensions of the field so that the area of the rectangular portion is the largest 
possible. (Take 7r = 3-14169.) 

14. A tinned soup manufacturer uses tins which are circular cylmders 
closed top and bottom. Find the most economical dimensions of the tin 
(i.e. minimum surface area) when the volume is given. If the top and bottom 
of the tin are out from square sheets and the surplus wasted, find the new 
din^nsions for greatest economy. 

vyjL Express the ^jUstance of 0 from a point on the line 2a; -f y = 6 as a function 
of the aj-co-ordinate of the point. Find the point on the line nearest 0 and 
deduce that the shortest distance from O to the line is perpendicular to the 
line and of length i^6. Generalise to show that the shortest distance from 

. ^ ^ XI. T r rv • cu;i + 6yi-fc 

(ajj, yi) to the Ime oa; + 6y -i-c = 0is - - - X — • 

va*-f 6* • 
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Find, on the part of the rectangular hyperbola = 4 in the positive 
quadrant, the point which is nearest to O and show that the shortest distance 
is perpendicular to the tangent at this point. What is the shortest distance of 
the point (0, 2) from the parabola y = x*1 

17. Show that the curve y = 2x-Z^-^\a convex from below for all positive 
values of x. Is the same true of the curve t/ = aa; + 6 + — T 

X 

18. Show that the curve y = ax^ bx* + cx can have only one point of in- 
flexion. If a is positive, show that the curvature changes from concave to 
convex from below as we pass through the inflexional point from left to right. 
Deduce that the point of inflexion ia also a stationary point if 6* = Sac, 

2x 

19. Show that the curve !/ = ^, j has three points of inflexion separated 

by a maximum point and a minimum point. Verify these facts by drawing a 
graph of the curve. 

20. Prove that the curve y = x* has a single stationary point which is a point 
of inflexion. Are there any other points of inflexion ? 

21. Show that the curve y = ^x is convex from below for negative values 
of X and concave from below for positive values of x. Deduce that the origin 
is a point of inflexion. Why is this point not given by the criterion that the 
second derivative is zero 7 Check the result by considering the function as the 
inverse of y = x*. 

22. From the second derivative, verify that the rectangular hyperbola 
xy=li3 concave from below for negative values of x and convex from below 
for positive values of x. In what sense is x = 0 a point of inflexion 7 Contrast 
this case with that of the previous example. 

If /(x) is a single-valued function of x, find where {x/(x)} attains maxi- 
mum and minimum values and interpret in terms of the curve y—J(x), If 
X and/(x) are both positive, show that {x/(x)} can only be a maximum at a 
point where the curve y =/(x) is downw€u:d sloping with a curvature less than 
a certain amount. 


Economic applications of derivatives 

24. Show that the demand curves p = — ^-c and p = {a-bx)* are each 

downward sloping and convex from below. Do the same properties hold of 
the marginal revenue curves 7 Show further that, for each of the demand 
laws p = s/a - bx and p = a- 6x*, the demand and marginal revenue curves are 
downward sloping and concave from below. 

25. Show that the demand curve x = 4o* - Sap* + p*, where a is a positive 
constant, and p is less than 2a, is downward sloping with a ppint of inflexion. 
How does the, curvature change 7 

26. With the aid of derivatives, check the positions cf maximinn total 
revenue obtained in the cases of the demand laws of Examples V, 2, 10 £ind 11. 

^7.' It is given that a demand curve is convex from below at all 

points. Show that the marginal revenue curve is also convex from below 
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<i*p 3 

either if is positive or if is negative and numerically less than - . 

If the demand curve is always concave from below, does a similar property 
^old of the marginal revenue curve ? 

^ By examining the signs of certain derivatives, show that each of the 
cost functions 

/7=N/aa; + 6 + c and i7=aa;^-i-^ + d (^>c) 

$ x-\-c 


gives average and marginal cost curves which fall continuously with increasing 
output. 

29. From the sign of the second derivative, show that the transformation 
curve of Examples V, 26 is concave from below at all points. 

" 80. If the supply of a good is related to its price by the law x = as/p- 6 + c, 
where a, b and c are positive constants, show that the supply curve is upward 
sloping and concave to the axis Op at all points. Illustrate with the case of 
Ex^ples V, 23. 

An indifference map is defined by the relation {x + h)s/y + k = a, where 
h and k are fixed positive numbers and a is a positive parameter. By 
expressing i/ as a function of x and by finding derivatives, show that each 
indifference curve is downward sloping and convex from below. 

$2. If /I = ax* + 6aj + c is the cost function of a monopolist and if p = p- ax 
is t^e demand law, find the monopoly price and output when the monopolist 
is assumed to fix the price. Verify that this is the same result as when the 
monopolist fixes the output. 

834 A radio manufacturer produces x sets per week at a total cost of 
+ 3a;+100). He is a monopolist and the demand of his market is 
x=15- 2p, when the price is £p per set. Show that the maximum net revenue 
is obtained when about 30 sets are produced per week. What is the monopoly 
price? Illustrate by drawing an accurate graph. 

'"M, If the manufacturer of the previous example, with the same costs, 
pr^uces for a demand of x= 100 - 20 s/p sets per week, show that he should 
produce only 26 sets per week for maximum monopoly revenue. What is the 
monopoly price now? 

85. In the case of Example 33, a tax of £k per set is imposed by the gover- 
ment. The manufacturer adds ' the tax to his cost and determines the 
monopoly output and price under the new conditions. Show that the price 
increases by rather less than half the tax. Find the decrease in output and 
monopoly revenue in terms of k. Express ^he receipts from the tax in terms 
of k and determine the tax for maximuiA return. Show that the monopoly 
price increases by about 33 per cent, when this particular tax is imposed. 

86. Generalise the taxation problem of the previous example by finding the 
effects of a tax of k per unit of output when a monopolist’s total cost is 
n=ax* + 6aj -f c and the demand law is p = - oea?. Show that the tax brings 
in the maximum return when k~\(^-~b) and that the increase' in monopoly 
price ii always le#s than the tax. 

37. If a* monopolist has a total cost of n—ax* + bx-\-c and if the dem^d 
law ia ax*, show that the output for maximum revenue is 


/a* + 3a(B^b)^a 
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How many sets per week should t he rad io manufaoturer of Example 33 
produce when the demand is a? = 10 n/ 26 -p sets per week? 

38. The demand of a monopolist’s market is p = ~ ax and he produces an 
output of X units at a total cost of n=ax* - hx* + co; + d. Show that the output 
for maximum net revenue is the positive root of 

3aa;»-2(6-a)a?-(j3-c) = 0. 

Taking the cost conditions of the hrm of Examples V, 18, and the dememdlaw 
p = 60 ~ fa?, show that the firm must produce 3^ tons of its product per month 
for maximum monopoly revenue. What is the monopoly price per ton? 

A firm with a total cost function n=F{x) sells on a perfectly competi- 
tive market, the market price being fixed at p. Show that the output of the 
firm for maximum net revenue is such that marginal cost equals p, provided 
that total costs are covered. Deduce that there is a supply relation for the 
firm, giving the output as a function of the market price p. 

40* If F{x)=:ax*-^bx-\-c in the previous example, show that the supply 
relation is linear. Show that p must exceed 6 + 2‘Jac if total costs are to be 
covered but that, if only variable costs are to be covered, p need only exceed b. 
Illustrate the determination of the supply curve by drawing a diagram show- 
ing the average and marginal cost curves. 

41. A sugar refinery has total cost equal to -f 6aj -I- 200) when x tons 

of sugar are produced per week. The fixed market price is £p per ton. What 
is the supply ciurve of the firm? What is the lowest price to cover total costs? 
At what price will 160 tons be produced? 

42. A plant produces x tons of steel per week at a total cost of 

f - 3a;* -I- 60a; -f 300). 

If the market price is fixed at £33 6s. 8d. per ton, show that the plant produces 
16t tons per week. 

43. In the problem of the previous example, show that the plamt’s output 
at the fixed market price of £p per ton is the root of 

3a;* - 60a; -f 10(50 -p) = 0, 

which ia greater than 10 tons per week. What is the smallest price for total 
costs to be covered ? Show that the supply curve is 

a;=10-fiN/30p-600 

for values of p greater than this minimum amount. Connect the supply curve 
with the marginal cost curve and illustrate graphically. 

44. The market demand for a good is given by p = - aa;. The market is 

supplied by two duopolists with cost functions /Jj = o^a;!* + and 

17, = Ojo;,* -f 6^, -f- c,. Assuming that the “ conjectural variations ** are zero, 
show that the reaction ciirves cure straight lines. Deduce the equilibrium 
output of each duopohst. 

/ 45. The duopolists of the previous excunple are rcwiio manufacturers pro- 

ducing identical sets. The total cost of an output of x sets per week is 
^ (i?^* + 3aj -f* 100) in eaoh case. When the price is £p per set, the market 
demand is a; = 76 - 3p sets per week. Show that the total equilibrium output 
is approximately 41 sets per week. Compare with the monopoly output of 
Example 33. 

a M.A. 
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46 . If the market demand of the previous example is x— 10^26 -p sets 
per week, show that the reaction curve of the first duopolist is 

= J (>/»,* + 16a;, + 6616 - 2a?, - 4) 

and similarly for the second duopolist. Draw a graph showing the two re- 
action curves and deduce that the total output is now approximately 32 sets 
per week. Compare with the monopoly output of Example 37. 

A firm, with fixed plant, supplies of raw materials, etc., produces two 
go<^ X and Y in amounts related by the transformation function y^f(x). 
The market prices of the two goods are fixed at Pg^ and p^. Show that the 
outputs for maximum total revenue are such that - /'(a?) equals the ratio of 

4^ If the firm of the previous example monopolises the sale of both X 
ana Y, the demand curves for X and Y being given and independent, show 
that the outputs for maximum total revenue are such that equals the 

ratio of the marginal revenues from the X and Y demands. 

-^49. A steel plant is capable of producing x tons per day of a low grade steel 

40 ~ 5x 

and y tons per day of a high grade steel, where = * If the fixed 

market price of low grade steel is half that of high grade steel, show that 
about tons of low grade steel are produced per day for maximiim total 
revenue. 

50. The steel producer of the previous example monopolises the sale of both 
quality steels. If the prices of low and high grade steel are and £py per 
ton, the demands are pgg — 20-x and = 26 - 2y. Find an equation giving 
the output X of low grade steel for maximum total revenue. Show, by a 
graphic^ method, that just under 6 tons of this steel are produced per day. 
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EXPONENTIAL AND LOGARITHMIC FUNCTIONS 
9.1 Exponential functions. 

We have been concerned, so far, with power functions and with the 
variety of functions that can be derived, in one way or another, from 
power functions. Such functions are of great practical importance 
and they serve to describe, accurately or approximately, many of 
the ways in which one variable depends on another. It is now con- 
venient to extend the range of om* function concept by the definition 
of an entirely new function type. 

The power function is represented by the general form j/=x”, 
where n is any given number. A new function can be defined by the 
simple process of taking the base of the power as a fixed number and 
the index as variable. So, instead of writing a fixed power df a 
variable number, we write a variable power of a fixed number. The 
new function so obtained is called an exponential function and we 
write y _ 

where a is the fixed base of the function. Since a power can be 
defined for all values of the index, the exponential function is a 
function of a continuous variable x. 

It is found convenient, in general, to limit the value that can be 
allotted to the base a of an exponential function. If a is negative, 
then many values of a® (e.g. when a: is | or are not defined in 
terms of the real number system. To avoid this difficulty, we always 
take a as a positive number. If a is a positive fraction (between 0 

and 1) and if we write 6 = then 

a 



i.e. any power of a number less than unity can be reduced to a power 
of a number greater than unity, the sign of the index being reversed. 
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There can be no objection, therefore, if we take the base a as a 
number greaier than unity. This is assumed throughout the following 
development.* 

The graph of the exponential function can be plotted once the 
base is given a definite value. For the particular function y = 2», a 
table of values of x and y can be obtained : 


X 

... 

-2 

-i 

-1 


0 

i 

1 

i 

2 

... 

y 

... 

0-26 

0-36 

0-60 

0-71 

1 

141 

2 

2-83 

4 

... 


the value of y being taken, where necessary, to two decimal places. 
It is seen, from Fig. 62, that the points plotted from this table can 



be joined by a smooth line which is the curve representing the func- 
tion y=2*. The graph shown is constructed for selected rational 
values of x only. But, for an irrational value of a:, the value of 2® is 
defined as the limit of a set of rational powers and the corresponding 
point must fit into the graph in a continuous way. The function 
y = 2® is thus continuous. 

The graphs of other examples of the exponential function type 
y=a® Are of exactly similar shape. It will be seen later that the 
graph of one exponential function (e.g. y = 10*) can be obtained from 

♦ Notice that the case a= 1 is trivial. Tlxe exponential function is then the 
constant y == 1. 
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I that of any other (e.g. y = 2®) by a process of ‘‘ stretching ” or ‘‘ con- 
tracting ’’ in the direction of Ox. The general shape of the curve 
y=a® is always the same ; its steepness or gradient varies with the 
value allotted to a and that is all. 

We conclude that y =a* is a single-valued and continuous function. 
It is also seen that it is monotonic increasing, y increasing over the 
whole range of positive values as x increases from infinitely large 
negative to infinitely large positive values. It follows that, given 
any positive number p, we can find a unique value g so thatp=a«. 
An approximate value of g, for given values of a and p, can be read 
oflF the graph of the function y =a® as the abscissa corresponding to 
the ordinate p. We have, therefore, the important result that any 
positive number can be expressed as a power of a given number greater 
than unity. 

It can be shown that, in general, irrational values of g correspond 
to rational values of p, and conversely. To illustrate the nature of 
the correspondence between p and g, we can take the useful base 10 
and write, for example, 

34 = 10iwi* and 7240 = 10»»®»’, 

the indices being irrational numbers written correct to four decimal 
places. Again 

10*=2-1628 and 10*=31-6228, 

these numbers being irrational and given to four decimal places. 
The result we have just given leads us at once to the consideration 
of what are called “ logarithms 

9.2 Logarithms and their properties. 

The logarithm of a positive number p to the base a ( > 1) is defined 
as the index of that power of a which equals p. In symbols : 

Dbukition : If p=a«, then g=logaP. 

Since a logarithm is simply the index of the power to which the base 
must be raised to obtain the given number, it follov%, from what has 
been said above, that any positive number has a unique logarithm 
to a definite base greater than unity. On the other hand, a negative 
number has no logaritlpn to any such base. 
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The logarithms of practical work have the convenient number 10 
as their base. Since, to take two examples already quoted, 

34 = and 7240 = 10* 

follows that 

logio34 = 1-5315 and logio7240 = 3-8597. 

These logarithms are correct to four decimal places. The two nota- 
tions, one in power form and the other in logarithmic form, are 
simply two ways of saying exactly the same thing. 

The logarithms to the base 10 are called common logarithms and 
their values have been calculated and set out in tables of logarithms 
to four, five or more decimah places. Hence, to find the common 
logarithm of a given number, we have only to look up the number in 
the tables. The numbers 1-5315 and 3-8597 given above were, in fact, 
obtained in this way. No attempt is made here to give an account 
of the way in which common logarithms are used in practice to 
facilitate numerical work. There are many practical devices to 
remember and it is assumed that the technique is familiar or can be 
obtained by reference to a text-book on algebra. The general laws 
of logarithms, given below, provide the basis of the practical work. 

Returning to logarithms to any base a greater than unity, two 
particular cases are derived from a® = l and a^=a : 

loga 1=0 and log^a = 1. 

Three general laws are obeyed by aU logarithms i 

Law I. The logarithm of a product. 

The logarithm of a product of two numbers is the sum of the 
separate logarithms : 

log. (Pl ' Pi) = log.Pi + log.P». 

Law II. The logarithm of a quotient. 

The logarithm of a quotient of two numbers is the difference of 
the separate logarithms : 

J ) = log.Pi - log«p,. 

Law‘ III. The logarithm of a power. 

The logarithm of a power is the index times the logarithm pf the 
base of the power i 

loga(pV=n,logap. 
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The laws can be extended to the case of products or quotients of 
more than two numbers and they can be combined to give the 
logarithms of complicated products, quotients or powers. Since the 
logarithm of 1 is zero, we obtain a particular, and useful, case of 
Law II I 



Notice, also, that the powers of p to which Law III applies include 
fractional powers. For example, we can write 

loga n/^)* = logoiJ* = I \ogaP- 

Since logarithms are indices, the laws of logarithms are simply 
translations of the index laws. For example, since indices are added 
when powers are multiplied, we expect that the logarithm of a pro- 
duct is the sum of the separate logarithms. Formal proofe of the 
three logarithm laws, based on the index laws, are as follows : 

In Laws I and II, let 

logaJJi = and log^pa = 

SO that Pi = and p j = 

Then Pi . P 2 = 

and “ = — 

P2 

So Iog<,(pi .pa) =gi + ?a = log„Pi + log,p*, 

and logo ( - gg = log„ 2 >i - 

\P2J 

In Law III, let log^p =<7 so that p =a«. 

Then p" = {a^y^ = a’*®. 


i.e. loga (p”) =nq=n log^p. 

The use of logarithms, both in theory and practice, is evident from 
the three laws. Expressions involving sums or differences are not 
easily treated by the use of logarithms. The logarithm of a sum or 
difference is not reducible and it must be stressed, in particular, that 
loga(pi-fpr) is not equal to the sum of log^^pi and log^p,. On the 
other hand, by taking logarithms, an expression iiwolving products 
and quotients is much simplified. The use of logarithms here 
replaces multiplication and division by the simpler processes of 
addition and subtract^n. Further, logarithms are equally useful in 
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dealing with powers and provide, for example, the only simple means 
of solving an equation in which the variable appears as an index. As 
a very simple instance, consider the equation : 

32‘-4=0 or 2*=|. 

Here log (2*) = log (f), 

i.e. X log 2 =log 4 - log 3, 



The logarithms can be to any base. Taking the base as 10 and look- 
ing up the logarithms concerned (to four decimal places) in tables. 


0-6021 -0*4771 126 

X = = = 0-416, 

0*3010 301 


correct to three decimal places. Other instances of equations most 
readily solved with the aid of logarithms arise in problems of com- 
pound interest (see 9.6 below). 

Since the base of a logarithm can be chosen arbitrarily and for 
convenience, it is useful to have the following law which connects 
logarithms to one base with those to another base : 


Law IV. The change of base in a logarithm. 

The logarithm of a number to the base a is the logarithm of the 
number to the base b times the logarithm of b to the base a : 

log,p=log„61og»p. 

As a particular case, put p=a. Then, since logoa = l, we have 
log„61og>o = l, i.e. 

The change of base law can thus be written in two alternative ways : 
log.p = log. 6 logtP = . 

The formal proof of the law proceeds : 

Let log^p=2' so that p=b*. 

Then log^p =loga(6<') log^b^log^b log^p. , 

The change of base law shows that the logarithms of a set of 
numbers to one base are simply constant multiples of the logarithms 
of the same set of numbers to another base. To change the base of 
a whole set of logarithms is only a matter of multiplying each 
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logarithm by a constant. For example, logarithms to the base 2 
can be obtained from common logarithms : 

logaP = (logalO) logioP = (j^^) logioP- 

From tables of common logarithms, therefore, it is possible to con- 
struct tables giving logarithms to any base other than 10. 

9.3 Logarithmic functions. 

If the variable y is the value of the logarithm of the variable x to 
a given base a, we obtain the logarithmic function 

y=log«ar. 

The relation between x and y defined by the logarithmic function is 
not, however, a new one. The logarithmic function, in fact, is simply ' 
the inverse of the exponential function. This follows from the^ 
definition of a logarithm; if y—\oga^> then a:=a*^. Since the 
exponential function is single-valued, continuous and monotonic 
increasing (o>l), the logarithmic function, as its inverse, possesses 
exactly similar properties. 

For any definite value of a (>1), the graph of y = logaa; is the 
graph of the exponential function y=a® with the axes Ox and Oy 
transposed. Alternatively, 
the graph can be obtained 
directly from tables of log- 
arithms. The graph of 
y=\og^x 

is shown in Fig. 63. It can 
' be derived from the graph of 
t/ = 2*, shown in Fig. 62, by 
transposing the axes. 

Logarithms to one base 
are constant multiples of 
logarithms to another base. 

It follows that the graph of 
y=\ogi,x can be derived 
froiu that of y=^\og^x by multiplying all ordinates by a constant 
i amoimt, i.e. by “ stretching ” or “ contracting ” the graph in the 
direction of Oy, All logarithmic functions have the same general 
shape when represented graphically ; the steepness of the graph 
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varies with the value given to the base but the general shape of the 
graph remains. Further, since the graph of an exponential function 
is the graph of the corresponding logarithmic function with axes , 
transposed, one exponential graph is obtainable from another ex-i 
ponential graph by a process of stretching ** or “ contracting ’’in 
the direction of Ox. This fact was stated above ; it has now been ^ 
justified. 

The graph of a logarithmic function shows the following properties. 
There is no logarithm of a negative number ; the logarithm of a 
positive number less than unity is negative ; the logarithm of the 
number 1 is zero ; the logarithm of a number greater than unity is 
positive, increases and tends to infinity as the number increases. 
These properties are obtainable, of course, directly from the defini- 
tion of a logarithm but they are particularly clear from the graph. 

It is interesting to compare the three function types we have now 
considered. In their simplest forms, we can write 
f/ = log„a;; f/=a*, 

where a and a are greater than unity and where n is taken as positive. 
As X tends to infinity, so does the value of each function. But it 
can be shown that log^x tends to infinity more slowly than and the 
latter, in its turn, tends to infinity more slowly than a®. So, for a 
large value of a:, the functions ascend in order of magnitude : 

The proof of this statement can be given as follows. 

If h is positive and if A; is a positive integer, the Binomial Theorem 
(proved in text-books on algebra) gives 


Let = where h is positive and suppose that the number x lies 
between the integers k and [k 4- 1), i.e. k <,x<k-{- 1. Then 


V-= 

’ X 


(1+A)» ^ (1+A)* ^ 1+AA hh 
■Jx ^ 


Hence, “ is greater than \}i^k and the latter tends to irinnity as i. 


and so a;, tends to infinity. So 


a* 

>.QO 

X 


as 


a:->QO , 
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Now 



where 


X 


n 


and this expression tends to infinity as y, and so as x, tends to 
infinity. 

X^ oc* 

Finally ^ =n — where z^nlo^^x, 

Iog,a; 2 

and this tends to infinity as z, and so as x^ tends to infinity. 

Cl* X** 

So >00 and = >oo as a;->oo , 

x^ log^x 

and log^a;, and a® must be in ascending order of magnitude if x 
is given any large value. 

The result can be illus- 
trated by comparing the 
three functions 
y=log2a;; t/=x*; y = 2® 
graphically. Fig. 64 shows 
the graphs of the three 
functions plotted on the 
same scales. It is clear that 
the graphs rise to the right 
at different rates and that 
logjX <a;* <2* 
for any large value of x. 



9.4 Logarithmic scales and graphs. 

The introduction of logarithms enables us to extend the process 
of representing a fimction graphically by measuring the variables, 
not on the familiar “ natural ” or numerical scales, but on what are 
called “ logarithmic ” scales. The advantages and applications of 
this radical change in the graphical method are described m the 
present and following sections. 

A variable x is measured according to some definite scale and a 
series of its values is represented by points on an a^ Ox. The usual 
method is to take distances along Ox from a base point 0 as equal, 
or proportional, to the values of x plotted, a method which gives the 
natural scale for x. Now suppose that the points are plotted at 
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distances along Ox which are equal, or proportional, to the logarithms 
of the values of x concerned, the point marked x on the axis being a 
distance logo; from the base point. It is convenient, but not 
essential, to take logarithms to the base 10 from this purpose. A 
logarithmic scale for x is thus obtained. 

The important characteristics of the logarithmic scale as compared 
with the natural scale are best introduced by considering some 
numerical examples. Of the two sequences of numbers 

100, 150, 200, 250, 300, ... , 

100, 150, 225, 337-5, 506-25, ... , 

the first shows a regular increase of 50 units and the second a regular 
increase of 50 per cent, from one number to the next. On a natural 
scale, the points representing the first sequence appear at eqiial dis- 
tances from each other and those representing the second sequence 
at increasing distances along the axis. The logarithms are 

2, 2-176, 2-301, 2-398, 2-477, ... , 

2, 2-176, 2-352, 2-528, 2-704, ... . 

Hence, on a logarithmic scale, it is the second sequence that gives 
points at equal distances from each other and the first sequence 
shows points at decreasing distances along the axis. In the same 
way, the two decreasing sequences of numbers 

100, 80, 60, 40, 20, ... , 

100, 80, 64, 51-2, 40-96, ... , 

show decreases of 20 imits and of 20 per cent, respectively. The first 
sequence is represented by points at equal distances on a natural 
scale and at increasing distances (to the left) on a logarithmic scale. 
The second sequence corresponds to points at equal distances on a 
logarithmic scale and at decreasing distances (to the left) on a 
natural scale. 

' It appears, therefore, that equal distances between points on a ' 
V natural scale indicate equal absolute changes in the variable, and ; 
equal distances between points on a logarithmic scale indicate equal 
yproportibrial changes in the variable. This property is easily verified 
in general. If x^, x^y x^ are values shown by points at equal distances 
on a natural scale, then 
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and the variable increases by equal absolute amounts. The same 
property on a logarithmic scale impUes that 

log - log X, = log Xj - log Xi, 


i.e. 


i.e. 


log^=log^ 

X. x^ 


Xt 


9 


and we have equal proportional changes in the variable. The obverse 
of this property is that equal proportional increases are shown on a 
natural scale by points at distances from the base point which 
increase more and more rapidly, while equal absolute increases are 
shown on a logarithmic scale by points at distances along the axis 
which increase more and more slowly. 

If points are plotted from a ftmction y =f(x) by taking x along the 
horizontal axis Ox on a natural scale and by taking y along the 
vertical axis Oy on a logarithmic scale, we obtain the semi-logarithmic 
graph of the function. The graph shows a curve drawn through the 
plotted points and the varying height of the curve shows, not the 
variation of y with a;, but the variation of log y with x. The curve 
is obtainable, of course, as the natural graph of log y as a function 
of X. If the curve is seen to rise through equal heights over certain 
ranges of values of x, then the value of y is subject, not to equal 
absolute increases, but to equal proportional or percentage increases. 
The use of semi-logarithmic graphs is thus clear. If we wish to 
compare percentage changes in the value of one variable as the 
other variable increases steadily, we plot the relation between the 
variables on a semi-logarithmic graph. If it is absolute changes 
that are important, we plot the relation on a natural graph. 

Many statistical time-series are appropriately plotted as semi- 
logarithmic graphs. Time-intervals are taken, on a natural scale, 
on the horizontal axis. The variable of the time-series, to be plotted 
on the vertical axis, may be such that proportional changes in its 
value are important. Trade, consumption or employment figures 
for an increasing population are cases in point. * In such ‘cases, a 
logarithmic scale is used for the vertical axis and a semi-logarithmio 
graph is drawn. An example is given in the following section. 
Another application can be quoted in which the logarithmic scale is 
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used for the independent variable measured along the horizontal 
axis. Suppose y represents the number of families with incomes 
over £x. Then the relation of y to a: (the ‘‘ cumulative ’’ frequency 
distribution of incomes) is best plotted with t/ on a natural and x on 
a logarithmic scale. The characteristic that is most important is 
the variation of numbers for proportional changes in income.* 

The logarithmic graphing of a function can be carried one stage 
further by taking both variables on logarithmic scales, in which case 
a logarithmic graph is obtained. The curve representing the function 
y =f{x) now shows the variation of log y as log x changes and corre- 
lates proportional changes in y with proportional changes in x. So, 
if a curve on a logarithmic graph shows equal rises for equal moves 
to the right, then the result of increasing x by equal percentage 
amounts is to increase y by equal percentage amounts. The 
logarithmic graph is thus useful when proportional changes in both 
variables are important. For example, variables which are related 
for very small and, at the same time, for very large values are not 
easily shown on natural scales. Taking logarithmic scales, how- 
ever, reduces the large variations to reasonable proportions while 
magnifying the small variations.! 

togaiithmic graphs are extremely useful, though perhaps not well 
understood, in the statistico-economic field. Pareto’s Income Law, 
for example, asserts that a logarithmic graph shows a straight line 
relation between x and y, where y is the number of persons with 
incomes over fcr.J As income rises proportionally, the number of 
persons with that or higher income falls off in proportion. The main 
use of logarithmic graphs, however, is in the statistical correlation 
of economic variables by means of scatter diagrams (see 2.2 above). 
When we correlate, for example, the populations and unemployment 
rates in urban and rural areas of varying size, the variations are 
great and percentage changes are more significant than absolute 
changes. It is thus appropriate to plot the scatter diagram by 
logarithmic graphing. An example of a correlation scatter -diagram 

* See Gibrat, Lea InigaliUa jSconomiquea (1931) and Allen and Bowley, 
Family ^Expenditure (1936), pp. 63-6 and Diagram XIII. 

! This follows since log x increases more rapidly than x at first and then 
more slowly than x (see the graph of the logarithmic function). 

J See Pareto, Coura d'kconomie politique (1897), Book 3, Chapter 1 and 
Bowley, Elementa of Statiatica (4th Ed. 1920), pp. 
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plotted on logarithmic scales is given in the following section. 
Another important example of the use of a logarithmic scatter 
diagram arises in the statistical determination of demand or supply 
relations from the correlation of market data of prices and quantities 
demanded or supplied. The important thing here is the proportional 
change in price or quantity from one year to another, or in one year 
as compared with the “ norm 

9.5 Examples of logarithmic plotting. 

The methods of logarithmic graphing, described above in general 
terms, are most clearly appreciated by giving actual examples of 
their use. The two cases considered in detail below illustrate very 
different apphcations of the methods. 

Ex. 1. Value of total imports^ U.K,, 1820-1897. 


C5entre 

year 

Imports (£Mn.) 
average for 3 years 

Centre 

year 

Imports (£Mn.) 
average for 3 year* 

1821 

31 

1860 

202 

24 

39 

63 

260 

27 

43 

66 

280 

30 

47 

69 

298 

33 

47 

72 

352 

36 

64 

76 

373 

39 

63 

78 

376 

42 

66 

81 

407 

46 

77 

84 

396 

48 

91 

87 

367 

61 

107 

90 

428 

64 

140 

93 

410 

67 

172 

96 

437 


The variation in the value of U.K. imports, as given in the above table, 
is represented graphically in Fig. 65 by two different methods. The first 
graph takes a natural scale for the measurement of import values and the 
second graph a logarithmic scale. In each case, the years are measured 
along the horizontal axis on a natural scale and each import value is 
plotted at the centre of the averaged period. 

It is seta that changes in import values are almost imperceptible, on 
the natural scale, in the early years and very marked in the later years. 
But this hides a very important fact. The relative changes from one year 
to another are roughly of the same importance in the early years as they 
are in the later. The Icjgarithmio scale brings out this fact, the variations 
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in the height of the second graph being of comparable magnitude at the 
beginning and end of the period. These variations represent relative 
changes in import values. For example, imports were valued at £65Mn. 
per year in the period round 1842 and £77Mn. in the period round 1845. 
The increase was £12Mn. or 18*5 per cent, of the earlier figure. In the two 
periods centred at 1887 and 1890, the increase was £61Mn. or 16-6 per 
cent, of the 1887 figure. The natural graph shows the second increase as 
roughly 5 times the fiirst. The semi-logarithmic graph shows the second 
percentage increase as less than the first. For most purposes, the percen- 
tage increases are the more interesting and the semi-logarithmic graph is 
to be preferred. 

Ex. 2. Number of insured workers in employment, 1929-1935. 


Industry 

Employment 

per cent, increase ( + ) or decrease ( - ) 

July 1929 to July 1932 

July 1932 to July 1935 

1. Coal-mining - 

. 

~29-6 

+ 61 

2. Bricks and tiles 



+ 3L5 

3. Pottery 


-231 

+ 20-3 

4. Iron and steel 


-401 

+ 460 

6. Shipbuilding - 


-68-3 

+ 37 3 

6. Engineering, general 


-27*9 

+ 19-9 

7. „ electrical 


- 2-8 

+ 11-2 

8. „ marine 


-67-4 

+ 480 

9. Electrical apparatus 


-f 36-6 

+ 31-7 

10. Motors 


- 13-8 

+ 321 

11. Cotton 


-26*8 

- 1-3 

12. Wool - 


-15'6 

+ 121 

13. Silk . - - 


- 16-3 

+ 24-6 

14. Linen - - - 


. 34-9 

+ 20-6 

16. Leather 


- 91 

+ 141 

16. Clothing 



+ 3*8 

17. Boots and shoes - 


- 8-2 

+ 6-2 

18. Food, etc. 


- 1-8 

+ 60 

19. Building 


- 17-7 

+ 350 

20. Public works 


+ 31 6 

- 13-2 

21. Road transport 


+ 6-6 

+ 7-7 

22. Distribution 


+ 8*6 

+ 4-7 

23. Entertainment 


+ 17-9 

+ 26-8 

24. Hotels - - . 


+ 4-4 

+ 160 

25. Local^govemment 


+ 10-9 

+ 6-6 

All insured industries 

- 

- 9*4 

’ +11-9" 


The above table provides a picture of the decline in employment be- 
tween 1929 and 1932 iia various important industries and of the corre- 
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sponding recovery between 1932 and 1935.^ A graphical representation 
can be given by taking per cent, changes in the first period along the 
horizontal axis and per cent, changes in the second period along the 
vertical axis. One point is then plotted for each of the 25 industries, the 
height of the point indicating the per cent, change 1932-5 and the hori- 
zontal distance the per cent, change 1929-32. For convenience, the 
changes are referred to 100 as base in each case. Thus the decline in the 
coal-mining employment in the first period is represented by 704 (29*6 
per cent, decrease) and the recovery in the second period by 105*1 (5*1 per 
cent, increase). The point representing the industry has co-ordinates 
(70*4, 105*1). It remains to determine whether natural or logarithmic 
scales are the more appropriate. It is clearly convenient to represent 
an industry which has recovered completely (1935 employment == 1929 
employment) by a point with distance to the left of the 100 mark equal 
to height above the 100 mark. This is achieved by taking logarithmic 
scales. Suppose, for example, that emplo 3 nnent falls to four-fifths its 
1929 value in 1932 and then recovers completely by 1935. This implies a 
20 per cent, decline and a 25 per cent, recovery in the two periods, i.e. the 
plotted point is (80,125). Now 

+ log = 0, 

i.e. log 80 - log 100 = - (log 125 ~ log 100). 

So, on logarithmic scales, the point (80,125) is as much to the left of the 
100 mark as it is above the 100 mark, as required. 

A logarithmic scatter diagram of the data is shown in Fig. 66, the points 
representing the decline and recovery of the various industries. Most 
industries are shown by points in the N.W. quadrant, a decline followed 
by a recovery. The broken line, sloping downwards at 45®, corresponds 
to complete recovery, so that industries shown by points below the line 
have failed to recover completely and industries shown by points above 
the line have more than recovered their employment position. Six of the 
industries have increased employment in both periods, as shown by the 
points in the N.E. quadrant. The cotton industry, shown by a point in 
the S.W. quadrant, has a progressive decline in employment. Finally, 
Public Works employment has changed, as we should expect, in the 
opposite sense to the majority of industries and is shown by a point in 
the S.E. quadrant. Since the point is above the broken litie, the later 
decline Vas not sufficient to reduce employment to the 1929 level. 

* No correction has been made for the increase in the total insured popu- 
lation over the period 1929-35. The picture thus shows the recovery in a 
somewhat too favourable light. 
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It is to be noticed that we have marked our logarithmic scales 
according to equal intervals of the variable. In the vertical scale 
of Fig. 65, for example, the markings are at 10, 20, 30, ... (£Mn. of 
imports). The characteristic appearance of the logarithmic scale is 
then that the markings get closer and closer as we proceed up the 
scale.* Another point of importance is that a logarithmic scale 
avoids any difficulty about choice of zero mark or of units, a difficulty 
which is evident in using natural scales. In fact, a fixed distance 
on the logarithmic scale represents a given per cent, change no 
matter where we are on the scale and we have never to refer distances 

NUMBER OF INSURED WORKERS IN EMPLOYMENT, U.K., 1929-35. 



Fjo. CG. 

Jas in the case of the natural scale) to the fixed origin or zero mark. 
The logarithmic scale, in this respect, has a clear advantage over the 
natural scale. A minor difficulty of the logarithmic scale is that only 
positive values of the variable can be plotted (since negative numbers 
have no logarithms). An adjustment can always be made, however, 
to avoid negative values, as is seen in Ex. 2 above, where negative 
percentages are converted to positive numbers below 100. 

A functional relation between x and y can be represented, there- 
fore, in,, tffi'ee different ways on a diagram, as a natural, a semi- 
l^arithmic or a logarithmic graph. The differences betv^een the 

♦ Special graph paper, marked in this way either for semi -logarithmic or 
for logarithmic graphing, is provided commercially. But ordinary graph 
paper can be used and adapted for the purpose by plotting distances directly 
from the logeirithms. « 



228 MATHEMATICAL ANALYSIS FOR ECONOMISTS 


three methods are well brought out by considering what relation is 
represented by a straight line in each case. On natural scales, 

yz= 00:4-6 

is the relation represented by a straight line graph. If y is plotted 
on a logarithmic scale against o: on a natural scale, the relation 

j/=a6* 

is represented by a straight line graph. For, we can write the 
relation as log t/ = log a + a; log 6, i.e. logy is related linearly to x. 
Finally, on a logarithmic graph, the relation 


y=iaofi 

appears as a straight line. T^or, the relation can be written as 
log y= log a + 6 logo; and logy is related linearly to logo;. So, an 
exponential function appears as a straight line on a semi-logarithmio 
graph and the line’s gradient is the logarithm of the base of the 
exponential. A power function is shown as a straight line on a 
logarithmic graph and the gradient is the constant index of power. 
If y is the number of incomes over £x, for example, Pareto’s Law is 

y = — , where a and m are constants. Plotting on a logarithmic 

graph, we obtain a downward sloping straight line with a numerical 
gradient equal to the constant m. 


9.6 Compound interest. 

A sum of £100 is invested and accumulates at compound interest 
at the rate of 4 per cent, per year. If interest is added yearly, then 

£100 4- £4 = £104 = £100 (1*04) 
is the amount at the end of the first year. Again, 

£104 4- 104 = £104(1-04) = £100 (1-04)* 

is the amount at the end of two years. Similarly, £100'(l-04)* is the 
amount after three years, and so on. In general, if £y is the amount 
of the investment after x years, then 

y=:100(l-04)». 

Suppose now that interest is added twice a year. Then, with a rate 
of 4 per cent, per year, 2 per cent, ia added in each first half-year 
and another 2 per cent, in each second half-year. It follows, as 
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before, that yOO amounts at the end of successive half-years to 
sums given by the sequence 

£ 100 ( 1 - 02 ), £ 100 ( 1 - 02 )*, £ 100 ( 1 - 02 )*, .... 

Hence, at the end of x years, the amount is £y where 

y= 100(1-02)**. 

These results can be generalised at once. If £a is invested at 
compound interest at lOOr per cent, per year compounded yearly, 
then the amount after x years is £y where 

j/=a(l-fr)». 

If the interest is added n times a year, then 



In this result, which includes the previous one as a particular case, 
it is to be understood that a; is a discontinmus variable, taking values 

which are multiples of ~ . Discontinuity is an essential feature of 

this compound interest problem. 

The dependence of y on the parameters indicating the interest rate 
and the frequency of compounding interest is to be noticed. The 
amount y, after any period, is clearly larger the higher is the interest 
rate. Further, the amoimt is larger when interest is compounded 
twice a year than it is on yearly compounding. For, 

(l+ir)* = l+r + Jr*>l+r, 
i.e. a ( 1 H- ir)*®> a ( 1 + r)*. 

In general, the more frequently is interest added, the larger is the 
amount of a given sum at the end of any period. 

The amount of an investment increases over time in what is called 
a ** geometric progression each amount being a fixed multiple of 
the previous year’s amount. Analytically, we can express this 
growth at an ever increasing rate by the exponential function 

y=ab^“, where 6 = 1+- is a constant greater than unity. The 

growth is shown by the heights of successive points on a certain 
exponential graph, the points being spread out at equal distances 
along the horizontal time-axis. For example, the heights of points 
on the graph of y=2®«(shown in Fig. 62) at abscissa a: = l, 2, 3, ... 
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represent the amounts of £1 after successive years when interest is 
added yearly at 100 per cent. It is clear that semi-logarithmic graph- 
ing is appropriate to this problem. The compound interest growth 

curve is then a straight line with gradient log 6= log . The 

percentage rate of growth is a constant fixed by r and n. 

A simple problem illustrates the way in which logarithms are to 
be used in dealing with compound interest growth : 

A National Savings Certificate (1935 issue) costs 15s. and realises 208. 
at the end of 10 years. Assuming that interest is added four times a year, 
it is required to find the rate of interest represented by this growth. If 
the interest rate is lOOr per cent., then 

i/ = 15(l-hir)** 

is the amount (in shillings) after x years. But y = 20 when x = 10. 

So. 20 = 15(l + ir)^^ 

i.e. 401og(l+Jr)=log20-log 15=0 12494, 

i.e. log(l+ir) =0-0031235, 

i.e. * l+ir = 1-00722. 

So lOOr = 2-89 approximately and the rate of interest per year is 2*9 per 
cent, correct to one decimal place. 

The growth of an investment when interest is added at definite 
intervals is a function of a discontinuous variable. It remains to 
consider what meaning can be attached to a notion of growth at 
continuous compound interest. Our problem now is to examine the 
result of letting n, the number indicating the frequency of compound- 
ing interest, take larger and larger integral values. 

To start with a simple case, £ ^1 is the amount of £1 at the 

end of a year when interest is compounded at 100 per cent, per year 
n times in the year. The values of this expression for certain values 
of n are 


n 

1 

10 

100 

1000 

laooo 

‘(i+i)" ‘ 

2 

2-594 

2-704 

2-717 

2-718 








the values being given correct to three decimal places. It is clear 
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that ^1 tends to a definite limit, in the neighbourhood of 2*718, 


as n tends to infinity. It can be shown, by rigid methods, that this 
is correct * and the limiting value is denoted by the letter e. So, 


Definition : 


e = Lim 

n->QO 



It is possible to show that e, which must be a pure number, is very 
similar to the familiar number n and cannot be expressed in fractional 
form or as a terminating decimal. The value of e can be found, 
however, correct to any given number of decimal places by giving 


n a sufficiently large value in 



So, to five decimal places, 


6 = 2*71828. 


Our definition of e is such that the amount of £1 at the end of one 
year, when the interest at 100 per cent, is added more and more 
frequently, approaches the value £e. This is only one of the 
many uses of the number e which is of very great importance in 
mathematical analysis. 

Returning to the general case of compound interest growth, 




1 + 


m 




as n-^oo , 


since 


1 as m, and so as n — rm, tends to infinity. Also, 

. . 1 . 

since the discontinuous variable a; is a multiple of -, it tends to 

become less and less discontinuous as n increases. Hence, as n tends 
to infinity and interest is added more and more frequently, the 
compound interest formula tends to assume the form t/=ac*’* and 
the variable x tends to become continuous. We have now derived 
a concept of continuous compound interest as the result of a limiting 
process in which interest is compounded more and more often. Our 
result is : 


The amount of £a after x years when interest is compounded con- 
tinuously at the nominal rate of lOOr per cent, per year is given by 

y=ae’’*. 

• See Hardy, /Jure Mathernaiics {3rd Ed., 1921), p. 137. 
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This compound interest formula, dependent on a continuous variable 
X, is an abstract one. But, though interest can never be compounded 
continuously in actual practice, the formula can be taken as a con- 
venient and approximate representation of the actual state of affairs 
when interest is compounded frequently. 


9.7 Present values and capital values. 

A sum of £a is due x years hence and the rate of interest and the 
frequency of its compounding are known. Then we can determine 
the sum to invest now so as to produce the given sum of £a at the 
end of X years. This sum is called the present value, or the discounted 
value, of Sja available x years hence. 

If interest is compounded once yearly at lOOr per cent., then £y 
is the present value of fa available x years hence provided that 


So 


y(l+r)*=a. 

a 


Similarly, if interest is added n times a year at lOOr per cent., we have 


a 



Finally, if interest is added continuously at lOOr per cent., then 

ye^^=a, 

i.e. y—ae-^*. 

The present value iy of a given sum at a given date depends on 
the parameters r and n indicating the interest rate and the frequency 
of compounding. The present value is smaller, the higher is the 
interest rate and the more frequently is interest compounded. 

The uses of a computation of present value are fairly obvious. If 
a dealer has a claim on fa in a; years’ time, he can sell his claim now 
to another person, not for the full fa, but for the present value of 
this sum calculated at the current rate of interest. Further, we can 
find the total present value of a claim on a whole series of sums due 
in successive yeaiss. The series 

fao, ••• 

may represent the values in the current and in m successive years of 
the crops obtainable from a piece of land, of the outputs of a given 
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machine or plant or of the incomes due to a given individual. We 
have, in fact, a given stream of crop values, of output values or 
of incomes over time. The present value of the stream, if interest 
is added yearly at lOOr per cent, per year, is given by the sum 

Clm. 

This sum can be called the capital value of the land, machine or 
income stream in question. It represents the sum which must be 
invested now to produce incomes of fuj, ia^y ... ia^ in successive 
years. It is to be noticed that the capital value of an output or 
income stream depends, not only on the items of the stream and on 
the number of years that the stream flows, but also on the interest 
rate that is taken. One and the same income stream has different 
capital values when different interest rates are current. Simple 
examples can be given to illustrate this fact. 


Shares in a mining company are expected to produce dividends of £40, 
£32, £24, £16 and £8 in the present and in the four following years, and 
to be worth nothing thereafter. If interest is added once yearly at 5 per 
cent., the present or capital value of the holding is 


32 24 16 


105« 


)- 


£ 112 - 6 , 


the calculation being made with the aid of logarithms. A similar com- 
putation shows that the capital value is £116*2 if the interest rate is 
only 2\ per cent. These are the two sums which can be invested now 
to produce, at the respective interest rates, the given income stream over 
the five years. 

Shares in a new trading company are expected to produce dividends 
of £1, £2, £4 and £6 in the present and m the three following years. In the 
fourth year, it is expected that the shares can be sold for £120. The 
capital value of this holding, taking interest as added once yearly at 5 per 
cent, per year, is given by 

2 4 6 120 \ 

But, if the Interest rate is 2\ per cent., the capital value is found to be 
£121*1. This holding is worth less than the previous one if the rate of 
interest is 5 per cent, but worth more if the rate of interest is 2J per cent. 
The relative capital valuation of the two holdings is reversed by the 
change in the rate of interest. 
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In conclusion, we can consider a simplified economic problem in 
the field of capital and interest. Let £x be the income derived by 
an entrepreneur this year and £y his income next year. The relation 
between these two incomes is given by technical considerations as 
y=f(x) (see 6.6 above). The entrepreneur wishes to arrange his 
resources to give incomes which correspond to the largest possible 
present value. If the rate of interest, compounded yearly, is fixed 
at lOOr per cent., the present value of £x now and £y next year is 

F=a:+-i^=a: + ® • 


This is a maximum if - 7 -- = 0 and <0, i.e. if 
ax dx^ 

= o and ®<0. 

1-fr 1+r 

The first condition shows that r equals the expression { -f'(x)} - 1 
which, as the numerical gradient of the transformation curve 
y=f(x) reduced by one, is called the marginal rate of return over 
cost.* At the incomes for maximum present value, therefore, this 
marginal rate must equal the fixed rate of interest. The second 
condition is that/"(x)<0, i.e. the transformation curve must be 
concave from below, a condition which is satisfied in the ‘‘ normal ** 


9.8 Natural exponential and logarithmic functions. 


The introduction of the number e=Lim 




2-71828, 


provides us with the most important cases of the exponential and 
logarithmic functions. When the base of the function is taken as e, 
it is called a “ natural exponential or logarithmic function. The 
standard forms of the natural functions are y=e® and y=log^x and 
the curves representing them are similar to those already indicated 
(in Figs. 62 and 63) for the base 2. 

A more general form of the natural exponential function is 

where crand b are constants. But this function can be derived quite 
easily from the standard form y=e*. The method of derivation is 

♦ See Fisher, T?ie Theory of Interest (1930), pp. 169 et seq, €«id pp. 614-5. 
See also. Examples VI, 31. ^ 
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best described in a particular case and in diagrammatic terms. The 
curve y = is obtained by stretching the curve = in the hori- 
zontal {Ox) direction so that each point on the former is twice as far 
from Oy as the point with 
the same height on the lat- 
ter. This is illustrated in Fig. 

67. The curve = is 
then obtained from the curve 
= by stretching in the 
vertical {Oy) direction so that 
all ordinates are doubled, 
as shown in Fig. 67. Two 
stretching processes are thus 
needed to transform the 
standard curve i/ = e® into 
the curve y = In gen- 

eral, if a and b are positive, 
the curve y — ae^^ can be 
derived from the curve y = in two stages. The standard ex- 
ponential curve is first reduced in the Ox direction in the 
ratio 6:1, i.e. contracted if 6> 1 and stretched if 6<1. The cuiwe 
so obtained is then expanded in the Oy direction in the ratio a ; 1, 
i.e. stretched if a > 1 and contracted if a < 1 . As a result, the ordinate 



of the curve y=ae^^ at abscissa t is times the ordinate of the curve 
^ 6 


y=:e® at abscissa x* Exactly similar remarks apply to the relation 
between the natural logarithmic function y =a log, 6a:, where a and b 
are constants, and the standard form y = log,x. 

AU problems of growth at compound interest added continuously 
are described by means of a natural exponential function. An 
investment of £a at lOOr per cent, continuous compound interest 
increases over time according to the law y=ae*‘*. The particular 


* If the constants a and 6 are allowed to take negative values, a further 
modification^ of the standard exponential curve is needed. The curves 
y = and y — ae^^ differ only in that the abscissa {-x) gives th© same 
ordinate of one curve as the abscissa x of the other. One curve is the reflection 
of the other in Oy. One curve falls and the other rises from left to right. 
Similarly, the curve y=— is the reflection of the curve y = ae^* in Ox, 
Negative values of o and 6, therefore, require the reflection of the standard 
exponential curve in one pr other of the axes. 
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shape of the curve representing this growth depends, as we have 
seen, on the values of the constants a and r. The larger the 
interest rate, the steeper becomes the curve and the faster becomes 
the growth of the investment. The larger the original sum in- 
vested, the higher becomes the curve and the larger the amount of 
the investment at any time. 

A somewhat different use of the curve can be noticed. 

Any point {x, y) in the positive quadrant of the plane represents a 
situation in which Zy is available x years hence. If this point lies 

on the curve y=ae^^y then 
the present or discounted 
value of this sum is £a if the 
interest is reckoned continu- 
ously at lOOr per cent. The 
curve thus connects all points 
representing situations with 
the same present or dis- 
coimted value at a given 
rate of interest and, for this 
reason, it can be described 
as a discount curve, 

Fio. 68. A system of discount 

curves is derived by 

fixing the value of r and taking a as a parameter. Certain curves of 
the system are shown in Fig. 68, in the case where the interest rate 
is 10 per cent, (r = 0* 1). One curve of the system is obtained from any 
other by stretching or contracting in the vertical (Oy) direction and 
one curve passes through any given point in the positive quadrant 
of the plane. The discount curve system provides a simple means of 
comparing the present values of different sums available at different 
future dates. Suppose that two situations are given (certain sums 
available at certain dates) and represented by points in the plane Oxy, 
If the points are on the same discount curve, then the present values 
are equal and given by the parameter of the discount curve. If one 
point ii^ on a higher discount curve than the other, then the present 
values are different and the first situation corresponds to the greater 
present value. It is necessary, of course, that the rate of interest 
should be known and fixed ; the discount curvedaystem is given only for 




EXPONENTIAL AND LOGARITHMIC FUNCTIONS 237 


one rate of discounting. The whole system changes when the rate is 
changed, the curves becoming steeper for larger rates of interest.* 
A discount curve y=ae*'® becomes a straight Une log t/=log a+fx 
when it is plotted on a semi-logarithmic graph in which the vertical 
scale for y is based on natural logarithms. The line slopes upwards 
with a gradient equal to the interest rate r. The discount cuiye 
system of Fig. 68 then reduces to a set of parallel straight lines. 

In conclusion, it is easily shown that an exponential or logarithmic 
function to a base other than e can be expressed as a natural ex- 
ponential or logarithmic function. If 6 is a positive number, the 
definition of a logarithm enables us to write 

6 


The general exponential fimction then appears 

i.e. as a natural exponential function. Further, since 

- lOQ.X 

the general logarithmic function y=a log^co: can be written 

i.e. as a natural logarithmic function. 

There is, therefore, no need to consider exponential or logarithmic 
functions other than those to the natural base e. In practical work, 
it is convenient to take common logarithms and powers of the base 
10. But, in theoretical work, it is always found preferable to use 
the natural base e. In this case, for example, the derivatives of the 
exponential and logarithmic functions appear in their simplest form 
(see 10.1 below). Natural exponential and logarithmic functions are 
thus assumed in all our theoretical work and, if we write a logarithm 
without specification of the base, it is to be imderstood that the base 
is c. The step from the practical logarithms to the base 10 to 
theoretical logarithms to the base e is, however, a very simple one : 

- log. 10 . logio* = ( 2-3026) logi,x, 

the numerical multipher being given correct to four decimal places. 


* Another system of discoimt curves is obtained when a is fixed and r is 
taken as a peuameter. This sytem enables us to determine at what interest 
rate £y available in x yearp has a given present value of £a. 
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^ ^ EXAMPLES IX 

Exponential and logarithmic functions 

X* Draw a graph of y = 10*. Read off the values of x for y = 0*5, 1*2 and 
8*9 and oheck from tables of logarithms. 

2. Use logarithmic tables to construct a graph of y = logif^x. Add the 
graph of y = x- 2 and so solve \ogi^X’^2=x approximately. 

8. Find, with the aid of logarithmic tables, approximate solutions of the 
equation 2*. 3~®= 10 and of the equation 2®* = 7. 

4. Illustrate the fact that the logarithm of a non-prime number can be 
reduced to logarithms of primes by showing that 

logic 1176 = 3 iogio2 + logio3 + 2 logio7. 

Check the result by looking up the logarithms in tables. 

6. By expressing the left-hand side as a single logarithm, show that 
ilogV-log¥ + *log¥ = 0. 

Express y explicitly in terms of a: if it is given that 

21ogy + log(a?- l)-log(a?-|- 1) = 0. 

X / 1 + 2x 

6. Express the logarithm of /W - — ^ as sums and differences of 

1 -l- a? \ 1 “f* ox 

logarithms of simpler expressions. Hence, find the value of this expression 
when a; = 2*4. 

7. Show that (x + Vx* - 1) (x - *fx* - 1) = 1 and deduce that 

log(x-t- s/x* - 1)= ~log(a;-- sfx* - 1). 

8. Indicate why ^ logso; and x* 3~* both tend to zero as x tends to infinity. 

^ . xlogjX ^ 

Deduce that — >-0 as x-^co . 

u* 


9 . Yearly production of bricks, U,K,, 1816-1849. 


Year 

Bricks 

Mn. 

Year 

Bricks 

Mn. 

Year 

Bricks 

Mn. 

Year 

Bricks 

Mn. 

1816 

673 0 

1826 

1948-8 

1834 

1152-4 

1842 

1271-9' 

17 


26 

1350-2 

35 

1349-3 

43 

1168-9 

18 


27 

1103-3 

36 

1606-1 

44 

1420-7 

19 


28 

1078-8 

37 

1478-2 

45 

1820-7 

20 

949-2 

29 

1109-6 

38 

1427-0 

46 

2039-7 

21 

899-2 

30 

1091-3 

39 

1668-7 

47 

2193-8 

22 


31 

1125-4 


1677-8 

48 

1461-0 

23 

1244-7 

32 

971-9 

41 

1423-8 

49 

1462-7 

24 

• 


33 

1011-3 



• 



Data from Shannon, Bricks — A Trade Index, 1786-1849, Economica, 1934. 
Represent the above time-series on two graphs, one showing the number 
of bricks on a natural scale and the other on a logarithmic scale. Which 
graphical representation is to be preferred? 
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10. If y is the number of persons with incomes over £«, the following table 
is obtained for super-tax payers, U.K., 1911-2 : 


®(£000) 

6 

10 

16 

20 

26 

35 

46 

y 

11,564 

4143 

2114 

1327 

889 

507 

321 


Plot the data on a logarithmio graph and show that the plotted points lie 
close to a downward sloping line. Deduce that Pareto’s Income Law is 
approximately satisfied {y = alx”^) and estimate the values of the constants 
a and m. (See Bowley, Elements of Statistics^ 4th Ed. 1920, p. 347.) 


11, Plot the price and consumption data of Examples I, 30 as a scatter 
diagram on logarithmio scales for both variables. 


12. Number of Insured Workers in Staffordshire, at July, 1 935, and percentage 
Unemployed, at May, 1936. 


Area 

Insured 
(OOO’s) 
July 1936 

Per- 
centage 
unem- 
ployed 
May 1936 

Area 

Insured 
(OOO’s) 
July 1936 

Per- 
centage 
unem- 
ployed 
May 1936 

Audley 


1-7 

28-3 

Rugeley - 


3-2 

6-9 

Bicldulph - 


3-3 

25-8 

Smethwick 


36-8 

4-9 

Bilston 


16-8 

11-2 

Stafford - 


131 

30 

Brierley Hill 


10-2 

10-7 

Stoke 


120-7 

16-7 

Burton 


18*4 

8-8 

Tamworth 


8-3 

6-1 

Cannock 

• 

16-8 

7-3 

Tipton 


15-7 

7-8r 

Cheadle 


61 

27-3 

Uttoxeter 


2-7 

3-1 

Cradley Heath 


16-1 

9-8 

Walsall - 


41-9 

10-0 

Darlaston - 


10-0 

6-9 

Wednesbury 


11-6 

12-0 

Kidsgrove 


3 3 

60-6 

W. Bromwich 


22-7 

7-6 

Leek - 


IM 

14-3 

Willenhall 


11-8 

8-4 

Lichfield - 


2-6 

6-5 

Wolverhampton 

65-3 

IM 

Newcastle - 


16 3 

20-3 






Plot a scatter diagram on logarithmic scales. Is there any evidence of a 
correlation between unemployment and the size of the area? 


13. Given log, 10 = 2-3026, use tables of common logarithms to plot a graph 
of f/ = log,®. Put 1 / = 1 and read off the value of e. 

14. Plot graphs of y = e~*, y = }e** and y = Show how these curves 

can be obtained from the curve y = e*. 


16. If y = i(®® - «-*)» show that e** - 2ye* - 1 = 0. Solve this quadratic 
equation in 6* to show that ® = log(y f v^y*-f 1). Why must the other root of 
the quadratic be neglected 7 In the same way, show that y = i (®* + 

X = log (y + -- 1) are inverse functions. 


16. Show that 


y= 


0 


and «=) log 


l+y 

1-y 


are inverse funotiona. 
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17. Given logio 0 = 0*4343, use tables of common logarithms to evaluate 

approximately for a; = 0, 1, 2, 3, 4 and 6, Hence plot a graph of 
the curve y=xe~^ for positive values of x. (This is one of Pecirson’s system 
of curves for fitting to statistical frequency distributions. See Elderton, 
Frequency curves and correlation.) 

18. Using the method of the previous example, plot a graph of the curve 

for positive and negative values of x. (This is the normal curve oj 
error of statistic€d theory.) 

19. Indicate why x^e~‘* and x^e~^^ must tend to zero as x tends to infinity 

for €my finite value of n. From the graph of y = xe~^, illustrate that 0 

as x-^’co . 


Compound interest problems 

20. What is the amoimt of £100 after 6 years and what amoimt must be 
invested to realise £100 after 10 years when interest at the rate of 3 J per cent, 
per year is added (a) yearly, (6) twice yearly, (c) continuously? 

21. In how many years will cm investment double itself when interest at 
2J per cent, per year is added (a) yearly, (6) twice yearly, (c) continuously? 
What are the periods when the interest rate is 6 per cent. ? 

22. A National Savings Certificate costs 16s. and realises 20s. after 10 years^ 
Find the rate of interest involved when it is added (o) yearly, (6) twice a year, 
(c) eight times a year, (d) continuously. Show that the nominal rate is 
smaller, the more frequently is interest added. Take log^^e^: 0*4342946 and 
use Chambers* seven-figure logarithmic tables. 

23. The certificate of the previous example produces ITs. 3d. after 6 years. 
What rate of interest, added yearly, does this represent? Is it less than the 
rate over the complete 10 years? 

24. In a previous issue (1933), a certificate cost 16s. and realised 203. after 
8 years, 21s. 4d. after 10 years and 23s. after 12 years. Find the interest rate, 
added yearly, for each of these periods. Is it true, as claimed, that the present 
issue bears the same rate of interest as the 1933 issue over the 10 years period? 


25. Interest at lOOr per cent, compounded yearly is equivalent to interest 
at 100s per cent, compounded n times a year, a given sum producing the same 

amount after cmy period at the two rates. Show that r=^l+~^ -1. 


Deduce that s is less than r by an amount equal to fs* approximately when 
9 is small and n = 4. 


26. If lOOr per cent, compoimded yearly and 100/> per cent, compounded 
continuously are equivalent interest rates, show that p = log,(l -f r). Plot a 
graph of this relation to show that p is always smaller than r but approxi- 
mately equal to r when r is small. 

27. A sinking fund is formed by investing &x at the end of each year for 
m years. Show that the final amount of the fund is 

aj + »(! + r) 4- *(1 + r)* -f- ... + x(l -h r)*»^» 

when interest is added yearly at lOOr per cent, per year. By writing the sum 
of this "geometric progression (17.1 below), show that the fund will amount 
finally to £a if ^ 

Show that approximately £79-5 must be set aside each year if the sinking 
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fund (interest at 5 per cent.) is to replace a machine costing £1000 after 
10 years. (See Fowler, Depreciation of Capital, 1934, p. 131.) 

28. If the rate of interest (added yearly) will be lOOr^ per cent., lOOr, per 
cent., ... lOOr^ per cent, during the next m years, show that £a will amount 
to £a(l + fi) ( 1 + f i) . . . (1 + r^) at the end of the period. What is the present 
value of £6 due in m yecura ? 

The interest rates in successive years from now will be 4 per cent., 3 per 
cent., 2J per cent., 2J per cent., 3J per cent., .... Find the amount of £100 
after 6 years and the present value of £100 due in 4 years. 

29. Find the present value of £100 due 10 years hence when interest at 

2 per cent, per year is compounded (a) yearly, (6) continuously. 

30. A mine-owner derives an income of £2000 this year and his income 
falls by £200 in each following year imtil no income results. Find the present 
value of the income stream when interest is added yeculy at (o) 4 per cent, 
per year, (6) 6 per cent, per year. 

31. Why can £ ^ be taken as the present value of an income stream of £a 
a year for ever? (Interest at lOOr per cent, compounded yearly.) 

32. A fir plantation brings its owner nothing this year or next year. In 
the two following years the incomes are £300 and £400 and the income is 
thereafter £600 a year for ever. What is the discounted value at the beginning 
of the fifth year of the constant income stream when interest is added yearly 
at (a) 4 per cent, per year, (6) 6 per cent, per year? Find the present value 
of the whole income stream now at the same interest rates. Compare these 
values with those of Example 30 above. (See Fisher, The Theory of Interest, 
1930, pp. 133 et aeq,) 

33. Interest is added yearly at 3 per ceixt. per yeew. What is the present 
value of a perpetual income of £100 beginning two years from now? This 
income can be produced by investing £2000 in a business this year and £1200 
next year. What is the present value of the investment? Is it a profitable 
investment? 

34. Draw a graph of certain curves of the discount curve system when the 
rate of interest is fixed at 6 per cent. Use the graph to determine whether 
£220 due in 10 years has a larger or smaller present value than £160 due in 

3 years, interest being added continuously at 6 per cent, per yecu*. 

35. If r is a parameter, what is the form of the curve system y = lOOe*^ on a 
semi-logarithmic and on a natural graph ? Draw certain curves of the system 
using a logarithmic scale for y. Hence estimate at what continuous rate of 
interest £200 due in 8 years has a present value of £100. 



CHAPTER X 

LOGARITHMIC DERIVATION 


10.1 Derivatives of exponential and logarithmic fimctions. 

The introduction of the exponential and logarithmic functions makes 
it necessary to extend the list of standard derivative forms by the 
addition of the derivatives of the simplest of these functions, i/ = c* 
and y = log x. The rules for derivation then apply, exactly as 
before, to give the derivatives of more complicated exponential 
and logarithmic functions and of combinations involving these 
functions. 

Standard form derivatives are found from first principles, using 
the definition of the derivative and the properties of the function 
concerned. But the exponential and logarithmic functions are in- 
verse to each other and we have only to find the derivative of one 
of them from first principles, the other derivative following from the 
inverse function rule. The derivative of the logarithmic function 
y =log X is found more easily from the definition : 


log(a:+ 7t )-Iogx 1 

h ~h ^ 





where n denotes ^ and tends to infinity as h tends to zero. But 
the expression (l tends to the limit e as n tends to infinity. So 


i.e. 


Lun = logfl+l) 

kr^o h X \ nJ 


d 

dx 


\ogx = - 


- loge = -, 

X X 


The function inverse to y = is x = \ogy with derivative 


dx 

dy 


1 
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Hence, the inverse function rule gives the derivative 


i.e. 

= d , - 

d 

T-C® = C*. 

dx 

The derivative of the exponential or logarithmic function to a base 
a other than e can be deduced at once : 

Since 

logaa; = log„elog, x. 

we have 

^log„z = log.e^log.x=llog„«. 

Again 

a® — = 

and 


where u = x log^a and the function of a function rule is used. So 


The derivative of the logarithm of any function of x or of any 
functional power of e can be deduced from the function of a function 
rule. If u ==f{x) is a single-valued function of x, then 


d . d , du \ du 

— \ogu=z^ logii — -- - , 
dx du dx u dx 

or 

d , , fix) 

= • 

Similarly 

d d du du 

dx du dx dx * 


or 




fix) 


An important special case of the latter result is 

• d 


dx 


ae®® 


®g- 



-C-®. 
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The list of standard form derivatives, as now constituted, is 




1 > 


g<.-=o-log,o, 

i , 1, 


The following generalisations of the standard forms are also estab- 
lished : 

ax ax 


dx ~ dx' 


d . \du 

dx udx* 


where u is any single-valued function of x. 

The following examples illustrate the practical method of evalu- 
ating derivatives when the functions concerned involve exponential 
and logarithmic expressions : 

Ex. 1 . ^ ^ (2^) = 

A gX+l ^ gX+l ^ ^ J _ qX+I 

^gl-2a;^gl-2a;^(l - 2a:) = 


In general, 


3 - £«*+♦= 

dx 


Ex. 2. ^e** = c** ^ (a:*) = 2xe**. 

^e-** =e-*'^( -X*) = -2x6--*. 


^ e*‘+»*-* = ^ (** + 3x - 2) = (2x + 3) 


In general. 


gaa^+6*-fe « (2aa; 4- 
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Ex. 3. ^ a:*c* = 2a:e* + x*e* = x(x + 2) c*. 


- (a:* + 3a: - 2) e* = (2a: + 3) e* + (a:* + 3a: - 2) e* = (x* + 6* + 1 ) «•. 


In general, 


^(ox* + 6x + c)e* = {ax* + (2a + 6)x + (6 + c)}e*. 


E,.4. + 


In general. 


^ log(ax+6) =-- ** 
dx ® ax + 6 


__ d, ,ld,l„ 2 

Ei.6. 


s'og<*’^»*-2)-3TSrr2E('’’ + 3»-») 


2x+3 
x*+3x — 2 


In general, 
d 


1 I • . t . \ 2ax + 6 




Ex. 7. X* log X = 2x log x + x*-=x(l + 2 log x)* 
dx X 


(l+x)(l-2x) 


^ X* log X =x"“^(l + n log x). 


Ex. 8. ^(logx)>-2logxglog»-?i^. 
^(.ogx)-!«. 
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Ex. 9. Derivatives of the second and higher orders are to be obtained 
by successive applications of the derivation process. Thus 


d , 1 


d* , 1 

jlogx=-^; 


2 

log a: ; and so on. 


^ dx^ 

In general, the following results can be established : 

dx^ 

d^ 


x^=n{n - l)(n -2) ... (n -r + 


dx^ 




d^ , , ,,, ,2.3.4... (r-1) 

^,Ioga: = (-ir-i 


dx^ 


10.2 Logarithmic derivation. 

If yz=zf{x) is any single-valued function of a:, then 

d{\ogy) _\dy 
dx ydx* 

an important result with many practical uses. The proportional 
change in the value of the function as x increases from a: to (x4- A) is 
. fix + h)-f{z) 


fi^) 

The rate of proportional change in the value of the function at the 
point X is thus 

hf(x) h ydx dx ' 




d{\ogy) \dy 


The derivative — thus serves to measure the rate of 
dx y dx 

proportional change of the function. In diagrammatic terms, just 

dxi 

as measures the tangent gradient of the curve y —f{x) plotted on 
natural scales, so - measures the tangent gradient of the curve 


when plotted as a semi-logarithmic graph. In fact, whenever we 
consider proportional changes in the function y =f(x) as x changes, 
we use a semi-logarithmic graph and the appropriate derivative 
t^Qogy) . ~ 

- dx 

The logarithmic derivative provides the simplest method of 
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finding the derivative of a product or quotient. In general tenns^ 
if y =uVy where u and v are two single-valued functions of x, then 
log y = log {uv) = log u -f log v, 


and 


i.e. 




\dy_ 

y (lx 


Idu I dv 
udx vdx* 


u 


Again, if y -- , we have 


logy: 


and 


= log =:logU-logV, 


log y log u-^ log V, 
dx dx ^ dx ^ ^ 


i.e. 


\dy _\du Idv 
y dx udx vdx 

These two results are equivalent to the product and quotient rules 
for derivation given above and we have now established the rules 
with the use only of the sum or difference rule and the notion of 
logarithmic derivation. 

A completely general form of the product and quotient rules tan 
be given by adopting the same method of logarithmic derivation. 


Ify = 


where U2, W3, ... and v^, Vg, V3, ... are any given 


single-valued functions of x, then 

log y = logi^i + logW2 + log^3 + --- ^i-log v^-logv^- ... . 


1 du» 1 du 


^ 3 ^ _}_ ^1 _ 

dx *** V^dx V2dx 


1 dv2 1 dv^ 
dx 


1 dy 1 dui X U/CX2 

ydx Ui dx U2 dx 0.3 UUy C/2 ^2 ^3 ' 

Further, it is often more convenient, in practice, to evaluate a 
derivative by writing the logarithm of the function before proceeding 
to differentiate. In fact, we prefer to carry out the steps of the 
logarithmic derivation process rather than quote the rules for the 
derivatives of products or quotients. This remark holds particularly 
of the derivtition of a function which involves complicated products, 
quotients or powers. The following examples illustrate : 


Ex. 1 . y = 


(x + \)(x 4 ^) 
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Here log y = log a; - log (a: + 1)- log (a: + 2), 

and 1 ^^-2 

ydx X a + l x-\-2 x{x + l)(x^2)* 

dy -X -2 _ x^ -2 

dx^ (a; + 1) (a; + 2) a: (a; + 1) (a; + 2) (a; + 1)^ (a; + 2)* * 

Ex. 2. 

Here log y = 2 log a; + J log (2a; - 1) - J log (a; + 1), 

1^ 2 1 2 1 1 ^ 8a:^ + 7a;~4 

ydx x^2 2x-‘l 2a; + l 2x{x + \){2x-\y 

dy ^^^^l2x - I 8a:2+7a;-4 _ xjSx^ + lx -4) 

x + l 2x(x + l)(2x-l)~ 2{x -hi) J{x + l) {2x-l) * 


Ex. 3. 
Here 

and 

i.e« 


y where a and n are constants, 

log 2/=nlog a;- J(a;-o)^ 

- = - ~ (a; - a) =-(n +aa; - a;®). 

ydx X ' X 


dy 

dx 


= (n +aa; -a;2)a;”"'^e~i(*“®^. 


10.3 A problem of capital and interest. 

The exponential and logarithmic functions are important, as we 
have seen, in economic problems of capital and interest. A simple 
case can be considered here as an example of the type of analysis 
involved.* A good is produced at a given moment of time (< = 0) at 
a fixed and known cost. The good is not sold at once but is stored 
for a variable length of time. The value of the good increases in a 
known way as time goes on, the selling price per imit being £f{t) 
after t years. The function f{t) is assumed to be known and to 
increase with t. As concrete, though necessarily highly simplified, 
examples of the kind of problem considered, we can quote the 
obvious cases of wine and timber. New wine is bought by a dealer 
and laid down for sale at a later time. Or a given piece of land is 
put under timber at a given time and the timber is to be cut for sale 

♦ The development given here is bsised on the work of Wicksell, Lectures 
on Political Economy, Vol. I (Ed. Robbins, 1934), pp. 172-84, and of Fisher, 
The Theory of Interest (1930), pp. i61>6. o 
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at a later time. The problem is to determine the optimum time for 
selling the wine or cutting the timber. 

It is assumed that interest is added continuously at the fixed 
market rate of lOOr per cent, over the periods concerned. Then, if 
£x denotes the present value per unit of the good sold after t years, 

x=f(t)e-^ (see 9.7 above). 


It is assumed that the storage period of t years is chosen to maximise 
the net profit as discounted to the base time (/ = 0), i.e. to maximise 
the value of x since the cost of production is constant. The optimum 
storage period is given by 


dx 

dt 


0 


and 


d'^x 

di^ 


< 0 . 


Now logx=F{t) -rt where F{t)=\ogf{t), 

dt \x dt) X dt^ x^ \dt ) ^ * 

The conditions for the optimum storage period are thus 
F\t)=r and F"{t)<0. 

The first condition is that equal to the 

fixed rate of interest. At the optimum time of sale, therefore, the 
rate of proportional increase in the value of the good over time is 
equal to the market rate of interest. We find that it is the marginal 
variation of value that is the important factor. Further, 




i.e. the rate of proportional increase of value must be decreasing at 
the optimurtt time of sale. 

Fig. 69 shows the solution of the problem in diagrammatic terms. 
The curve represents the function y=f(t) on a semi-logarithmic 
diagram, log y being taken against L The zero point on the 
horizontal axis Ot is .the time (< -=0) when the initial cost of 
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production is incurred and that on the vertical axis is taken, foi 
convenience, as y=a, where fa is the fixed cost of production. 
The gradient of the tangent to the curve at the point P (at time t) 
is given by 

<^(iogy) /'(O 

dt "fit)' 


Interest is reckoned continuously at the given rate of lOOr per cent, 
per year and y=xe^* represents a system of discount curves, x 

being a parameter. Here, 
£x is the discounted value 
(at t = 0) of £y available 
at time t. The system 
appears on the diagram 
as a set of parallel straight 
fines (log y = log x-^rt) 
with gradient r and cut- 
ting the vertical axis at 
y—x. One line of the set 
passes through the point 
P and cuts the vertical at 
Q where 



OQ- 


:log X - log a =log ~ , 


since the origin is at i/=a. fx is now the discounted value of a unit 
of the good sold at time t and the optimum time of sale is 

obtained when x or OQ is a maximum, i.e. it is given by the point A 
where a discoimt fine touches the curve y—f{t). At t = the 
tangent gradient of the curve equals the gradient of the discount 

fU) 

line, i.e. =r. Further, for a genuine maximum, the curve must 


be concave from below, i.e. 


d^jlogy) 

dt^ 



This condition 


is satisfied in the “ normal ” case shown, the valuer of /(<) increasing 
rapidly at first and then more slowly. The point 5, where the 
tangent at A cuts the vertical axis, must lie above 0 if the dis- 
counted sale value of the good is to be greater than the cost 
incurred. , 
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10.4 The elasticity of a function. 

We have seen that — measures the rate of proportional 

change of the function y=f(x), the proportional change in y being 
related to the absolute change in z. Suppose, now, that we relate 
proportional changes in both variables. If x is changed from z to 

(x+A), the proportional changes in z and y are ^ and 

respectively. The average proportional change in y per unit propor^ 
tional change in a; is 

f(x + h)~f{x) 
f(x) h 

If the derivative of the function exists, the rate of proportional 
change in y for proportional changes in z is 


Lim 


J f(x) ydx' 


\f(x) h 

Denoting the logarithms of x and y hy u and v, we can write 
t; = logy, where y=f{x), where 
^ dv _dvdydx _\dy ^^_xdy 

du dydxdu ydx ydx 

The rate of change of y=f{z)y when both changes are expressed in 
proportional terms, is thus measured by 

y) ^xdy 
d (log x) ydx* 

This rate is termed the elasticity of the function at the point z and 
Ey E 

can be denoted by — f{x)* Hence i 
Ex Ex 


Definition : The elasticity of the function y=f{x) at the point x 
is the rate of proportional change in y per unit proportional change 
in z z 

^ Ey _ d(logy) _xdy 

Ex d(logx) ydx' 

* The notation adopted here is that suggested by Champemowne, A 
Mathematical Note on Substitution, Economic Journal, 1935. An alter- 
native notation is E^(y) = Eg^{J(x)). No established notation for elewticities 
is in current use. • 
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A logarithmic diagram is the appropriate representation of the 
function y=f{x) when proportional changes in both z and y are 
considered. The gradient of the tangent to the curve shown on the 

diagram is then • Hence, the elasticity of a function at 

various points is given by the tangent gradient of the corresponding 
curve when logarithmic scales are taken for both variables : the 
elasticity can be read oflF the logarithmic graph of the function just 
as the derivative is read oflF the natural graph. 

The important property of the elasticity of a function is that it is 
a number which is independent of the units in which the variables 
are measured. This is clear since the elasticity is defined in terms of 
proportional changes which are necessarily independent of units. 
More formally, if units are changed, so that the new measures of z 
and y are z' =Xz and y' respectively, then 

z* dy' _ Xzd(iiy) ^Xzfxdy _zdy 
y* dz' fiyd{Xz) fjiyXdz ydz* 

i.e. the elasticity of the fimction is unaltered. 


10.5 The evaluation of elasticities. 

Since the elasticity of a function is simply the derivative ^ with 

CLZ 

Z 

a multiplicative factor -, which makes it independent of units, we 

can obtain elasticities from the corresponding derivatives. It is 
interesting, however, to translate the derivative rules into elasticity 
form and to examine the nature of the elasticities of simple functions. 
If u and V are single-valued functions of z, then 

JEu Ev 
E{u±v) Ez Ex 
Ex u±v * 

E(uv)_Eu Ev ,, 

‘ Ex Ex ^ Ex * 


E 


Eu Ev 
Ez ~~ Ex Ex 
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Further, if 1 / is a function of where is a function of a:, then 

Ey Ey Eu 
Ex Eu Ex 

As an example of the method of proving these results, we have 
E(uv) _x d{uv) ^x / du dv\^xdu xdv_Eu Ev 

Ex uv dx uv\ dx dxl udx vdx^Ex Ex 

The other results are established in a similar way. 

It is to be noticed that the elasticity rules are simplest in the 
cases of product and quotient and more complicated for sum and 
difference. This completely reverses the position obtained in the 
case of the derivative rules. But it is no more than we expect since 
an elasticity is a derivative in which the variables are expressed as 
logarithms and since logarithms are designed to deal conveniently 
with products and quotients. Further, since the elasticity of a 
constant is zero, 

E{u^-a)_ u Eu - E{au)_Eu 
Ex ~'u^¥x 

i.e. it is the multiplicative constant that disappears in an elasticity. 

As examples of the elasticities of simple functions, we have : 

Hence it is the power function, and not the linear function, which 
has a constant elasticity. If a is a positive constant, then the 
function with the same positive elasticity a at all points is y—oxi^, 
and the function with the same negative elasticity ( - a) at all points 
is yx^=a. In particular, the fimction y=ax has elasticity 1 at all 
points and the function xy^a has elasticity ( - 1) at all points. In 
diagrammatic terms, the function y=axf^ is represented by an 
upward sloping straight line with gradient a on logarithmic scales 
and the gradient measures the constant elasticity of the function. 
In the same way, the function yx^=a is shown as a downward 
sloping line with numerical gradient a on logarithmic scales. The 
rectangulai® hyperbola a:y=a is a particular case, shown on a log- 
arithmic diagram by a line sloping downward with gradient and 
elasticity numerically equal to unity. 

The elasticity of a function varies, in general, with the value of x 
taken. Certain points may be found where the numerical value of 
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the elasticity is unity and these points are of particular interest. 
E 

If ^{f{x)}=^l at a point x, then a proportional increase in x from 
Ex 

this point gives rise to an equal proportional increase in f{x). On 
E 

the other hand, if 7 V"{/(^)}> 1 l^te point x, then a proportional 
Ex 

increase in x results in a greater proportional increase inf(x). Con- 
E 

versely, if ^{/(a:)}<l at the point x, the proportional increase in 

E 

f{x) is less than that in x. Similar remarks apply when = “ 1 

at a point x, but here proportional increases in x correspond to 
proportional decreases inf(x). 

Further, at a stationary value of a function, the derivative, and so 
the elasticity, is zero. As particular cases of the product and quotient 
rules for elasticities, we have 
E 

Ex\x j' 

It follows that a maximum or minimum value of the “ total ” ex- 
pression {xf(z)} can occur only at a point where the elasticity of 
f(x} is - 1 and that a maximum or minimum value of the “ average ** 

Finally, it is clear, from the definition, that the elasticity of a 
function f{x) is the ratio of the “ marginal value of f(x) to the 
‘‘ average ” value of f(x) at the point in question, i.e. the ratio of 

fix) . 

X 

maximum or minimum and the elasticity is equal to 1, we have also 
that the “ average ” and “ marginal ’’ values of the function are 
equal (see 8.5 above). Points where the elasticity of a function is 
numerically equal to unity are thus of considerable importance. 
This is particularly evident, as we shall show, when we consider the 
application of the elasticity concept to the demand and cost problems 
of ecoqomic theory. 


expression 


can occur only where the elasticity of f{x) is 1. 


/'(x) to 


Hence, at a point where the “ average ” value is a 


10.6 The elasticity of demand. 

The market demand for a good can be represented, under certain 
conditions, by the monotonio decreasing function x = (f>{p). The 
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elasticity of this function defines the elasticity of demand at any 
price. Since its value is negative at all prices, it is found convenient 
to make it positive by the introduction of a negative sign. This step 
does not affect the elasticity concept in any material way. So : 

Elasticity of demand 17= • 

The value of rj, which is independent of both price and quantity 
units, varies from point to point and always measures the rate of 
proportional decrease of demand for proportional increases in price 
from the price and demand in question.* 

When we consider proportional changes in demand and price, it 
is a great advantage to plot the demand curve on a logarithmic 
diagram, taking both price and demand on a logarithmic scale. The 
tangent to the demand curve is then downward sloping with a 
numerical gradient equal to the elasticity of demand. For example, 
the demand law x~ap~^ is shown, on a logarithmic diagram, aa a 
straight line with gradient ( ~ a). The elasticity of demand, in this 
caae, is constant and equal to a at all points. 

If demand curves are drawn on natural scales, as is usual in 
economic works, it becomes more difficult to estimate and compare 
the elasticities at various points on the same or on different demand 
curves. It is tempting to estimate the elasticity from the gradient 
of the demand curve, to say that a demand curve steeply inclined to 
the price axis has a large elasticity. This is incorrect. Demand 

T) I dx\ 

elasticity is the product of ^ and ^ - —j and its value cannot be 

judged solely from the second of these two factors. For example, 
the linear demand curve has a constant gradient but its elasticity is 
not constant. The elasticity, as is easily seen, decreases as the price 
decreases and the demand increases. 

Two diagrammatic methods have been devised by Marshall for the 
purpose of estimating the elasticity of a demand curve drawn on 

* The inverse demand function is p = *p{x) and the elasticity of price with 

respect to demand is ” ” the reciprocal of rj. Some writers use*the term 

flexibility of price to indicate the reciprocal concept (see IMoore, Synthetic 
Economics, 1929, p. 38). Notice also that the elasticity of demand and 
flexibility of price are sometimes written without the negative sign (see 
Schultz, Statistical LawtTof Demand and Supply, 1928). 
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natural scales.* The first method makes use of a simple geometrical 
property of the demand curve. With the notation of Fig. 70, tPT 
is the tangent at any point P on the demand curve and the elasticity 
of demand at P is 

_pl dx\_ MPMT _MT 
'^~x\ df)~ OMMP~ OM' 

MT PT 

A well-known geometrical property of parallel lines gives 

PT ON 

(since MP is parallel to Op) and (since NP is parallel to 

Ox). Hence: ^ 

_MT_ON PT 
m ~ tp ’ 

We have thus three convenient length ratios as measures of the 
demand elasticity at any point on a demand curve. 




The second method depends on the fact that any curve with 
equation is a demand curve with unit elasticity at all points. 

Such a curve can be called a constant outlay curve^ since the outlay 
(xp) of consumers is constant at aU prices. On logarithmic scales, 
the curves for various values of a form a set of parallel straight lines 
sloping downward with unit gradient. On natural scales, we have 
the system of rectangular hyperbolas shown in Fig. 71. Through 
each point P of a given demand curve AB^ there passes, one of the 
constant outlay curves. The elasticity of the demand curve AB at 
P can be compared with the unit elasticity of this constant outlay 
curve by comparison of the gradients of the two curves referred to 

* Marshall, Principles of Economics (8th Ed. 1027), p. 839. 
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the price axis. This is possible since the other factor ^ in the elas- 
ticity expression is the same for both curves at P. If the demand 
curve AB is steeper to the price axis than the constant outlay curve 
at P, then the demand elasticity is greater than unity at this point on 
AB. If AB is less steep than the constant outlay curve at P, then 
the demand elasticity of AB is less than unity. Finally, if AB 
touches the constant outlay curve at P (as shown in Fig. 71), then 
AB has unit elasticity of demand at P. Hence, the path of a given 
demand curve across the constant outlay curves gives us a good idea 
of the variation of demand elasticity along the demand curve. 


10,7 Normal conditions of demand. 

For any demand law p = ili{x), we have R=xp=xtlf(x), and 
dR d , ^ dp 


:=i-J^P)=P+^7i-p{^+l^) 


But the elasticity of demand ij = - and so 

X dp 



This is an important result. From it we deduce : 

(1) If 77 > 1 at a given price and demand, a small decrease in price 
results in a more than proportional increase in demand, marginal 

revenue ^ is positive and total revenue increases as output (=de- 
dx 

mand) increases. This is the case of elastic demand. 

( 2 ) If 7 ^ = 1 at a given price and demand, the small decrease in 
price and increase in demand are proportionally equal, marginal 
revenue is zero and total revenue has a stationary (usually a maxi- 
mum) value. 

(3) If t; <1 at a given price and demand, a small decrease in price 
is accompanied by a less than proportional increase in demand, 
marginal revenue is negative and total revenue decreases as output 
increases. * This is the case of inelastic demand. 

Our general idea of elastic and inelastic demands (see 5.4 above) 
is thus in agreement with the precise measure of demand elasticity. 

We have assumed, in the normal case of demand, that price 
decreases are associated with demand increases, and conversely, i.e. 
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that the demand curve falls from left to right. A further property 
of demand is now assumed to hold in the normal case. It is taken 
that, as the demand increases, the elasticity of demand decreases 
continuously from values greater than unity at small demands to 
values less than unity at large demands. As demand increases it 
becomes continuously more inelastic. It follows that there is one 
definite demand x—a where the demand elasticity is equal to unity. 
This normal property of the demand curve is illustrated by the 
curve AB of Fig. 71. The curve touches a constant outlay curve at 
one definite demand, is steeper to the price axis than the constant 
outlay curves for all smaller demands and less steep than the constant 
outlay curves for all larger demands. 

In the normal case of demand, therefore, total revenue R increases 
with output at first (r;> 1), reaches a definite maximum value at the 



output x=a (t; = 1) and then decreases as output increases further 
{7)<l), Average revenue, of course, decreases continuously as output 

increases. Now, firom the expression for ^ above, 

(1/OC 

marginal revenue = average revenue 

Since r] is positive, marginal revenue is less than average revenue at 
all outputs. Further, as output increases, rj decreases cfiad becomes 

unity at the output a: = a. Hence, for outputs less than a, is 

positive and decreasing. So marginal revenue must decrease as 
output increases until it becomes zero at the output a: =a where = 1. 
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For further increases in output, marginal revenue is negative but 
need not decrease continuously. The normal forms of the total, 
average and marginal revenue curves are indicated in the two 
diagrams of Fig. 72. The diagrams must be consistent in the sense 
that the ordinates of the average and marginal revenue curves are 
respectively equal to the gradient of the vector from 0 and the 
gradient of the tangent at the corresponding point on the total 
revenue curve. 

The linear demand law, 'p=a-bx, and the hyperbolic demand 

law, p=-^^ c, are both of the normal form here described. In 
^ a;4*6 

the first case, “7 ~ ^ second case, 




be ~ cx). 


Each of these expressions decreases continuously as x increases. 
The total, average and marginal revenue curves in particular cases 
of linear and hyperbolic demands are graphed in Figs. 44 and 45 
above. It is clear, from these graphs, that the demand curves are of 
normal form. 

In conclusion, an inter- 
esting connection between 
the average and marginal 
revenue curves can be 
estabhshed.* If the tan- 
gent at any point P on 
the average revenue curve 
cuts the price axis in A 
and if 4 is joined to the 
point Q on the marginal 
revenue curve at the same 
output as P, then the 
gradient of ^P referred to 

the price axis is twice that of AQ. In Fig. 73, PM and QN are per- 
pendicular to the price axis. Then (10.6 above) : 



OM ^ p 


i.e. 3IA=^ , 
V 


* See Robinson. The Ecoitomics of Imperfect Competition (1933), pp. 29 etaeq. 
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where ij is the demand elasticity at P. But marginal revenue equals 
average revenue times i. 

OM 


i.e. 
rj/ 

ON = OM 


and 


So 


NM = OM-ON = — =^ 


MA=NM. 


V 


rru A- 4. C A 4.1. • • • • A • 

The gradient of AP to the price axis is twice 

the gradient of to the price axis. ^ 

This result provides a method of tracing the marginal revenue 
curve from a given demand (or average revenue) curve. Select any 
point P on the demand curve, draw the tangent to cut the price axis 
in A, draw the line AQ with gradient to the price axis half that of 
AP and find the point Q on AQ with the same output as at P. Then 
Q is a point on the marginal revenue curve and the whole marginal 
revenue curve is traced by repeating the process for different points 
on the given demand curve. 


10.8 Cost elasticity and normal cost conditions. 

A less generally recognised, but extremely useful, application of 
the elasticity concept is found in the analysis of the cost problem 
(5.6 above). This application provides a striking contrast to the 
use of demand elasticity. The latter refers to the average quantity 
(average revenue) and enables us to determine, by relating its value 
to unity, whether the corresponding total quantity (total revenue) 
increases or decreases. The position is completely different in the 
cost problem. Here the total quantity (total cost) increases for all 
outputs and its elasticity is defined and used to deduce properties 
of the average quantity (average cost). 

If a firm produces an output a; at a total cost n=F(x), then 

• • 

• m 4* u P4 4 1 4 d(\ogn) 

Elasticity of total cost k=-^- 7 - = ° r • 

n dx a (log x) 

The value of k is independent of both cost and output units and 
measures the proportional rate of increase of total cost for proper- 
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fcional increases in output from the level in question. ♦ From the 
definition k is equal to the ratio of marginal to average cost. Further, 

the elasticity of average cost can be expressed : 

xdiT ^ /n\ I f dn \ a:d/7 

irdx ndx\xl n x'^V dx ) FI dx ^ 

From these results, it follows that : 

(1) If /c<l at a given output, we have the case of increasing 
returns in which a small increase in output is obtained at a less than 
proportional increase in cost, average cost is greater than marginal 
cost and average cost decreases as output increases. 

(2) If #c = 1 at a given output, the case is one of constant returns 
in which the small increase in output is proportionally equal to the 
increase in cost, average cost equals marginal cost and average cost 
has a stationary (usually a minimum) value. 

(3) If K>1 at a given output, we have decreasing returns and the 
situation is exactly the reverse of that when /c < 1. 

It has been assumed, in the normal case of cost conditions, that 
total cost increases continuously from a positive value (representing 
overhead costs) as output increases from zero. A further property 
of the normal case is now added, the property that cost elasticity 
increases continuously from values less than unity at small outputs 
to values greater than unity at large outputs. Returns become 
increasingly unfavourable as output increases. There is one definite 
output a;=a, where cost elasticity equals unity and returns cease to 
be increasing and become decreasing.*!* 

It follows, in the normal case, that average cost falls with increas- 
ing output at first, reaches a minimum value at the output x=a and 
then increases as output increases further. Marginal cost is less 
than average cost for outputs less than a and greater than average 

* The elasticity of total cost is described by Professor Moore (Synthetic 
Economics f p. 77) as the “ coefficient of relative cost of production ’* and its 
reciprocal as ^he • coefficient of relative efficiency of organisation **. The 
reciprocal of #c has also been used by W. E. Johnson (The Pure Theory o] 
Utility Curves, Economic Journal, 1913, p. 608) and by Professor Bowley 
(The Mathematical Oroundwork of Economics, 1924, p. 32). 

t As a slightly less severe normal assumption it can be taken simply that 
IT ~ 1 at some deiSnite output x = a, k< 1 for outputs x < a and #f > 1 for outputs 
x>a. 
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cost for outputs greater than a. At the output a; = a, marginal cost 
is rising and equal to the (minimum) average cost. The normal 
forms of the total, average and marginal cost curves are shown in 
the two diagrams of Fig. 74. The forms of the average and marginal 



cost curves can be checked from the form of the total cost curve, 
and conversely. If P is the point on the total cost curve at a given 
output, then average and marginal cost are to be read off as the 
gradients of OP and of the tangent at P respectively. 

The simplest cost function satisfying the normal cost conditions 
is the quadratic form 

n = ax^ 

where a, b and c are positive constants. Here 

^ x{2aX’\-b) 
ax^-\-bx-\-c 

The derivative of k can be shown to be positive, i.e. k increases as x 
increases. The total, average and marginal cost curves in the case 
of a particular quadratic cost function are graphed in Fig. 46 and 
they are seen to be of normal form. 

It may be convenient to take the total cost curve with a point of 
inflexion and the marginal cost curve with a minimum point at some 
output less than that at which average cost is a mlnimtim. This is 
shown in Fig. 74. The quadratic cost function does not exhibit this 
additional property and the simplest normal cost function to do so 
is the cubic form 


n = ax^ - bx^ + cx-\-d 
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where a, b, c and d are positive constants (6* <3ac). 

Average cost = ax* -bx + c + -, 

2 / 

Marginal cost ^ - 2bx -f c 
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Hero 


and 


dm 

dx^ 


— Gax - 26. 


dn 


Since — is positive for all values of x (since b^<^ac) and since 
= 0 at the single output x = ~ , the total cost curve rises but 

dX set ^ 

has a point of inflexion at the output . The average cost curve 
has a single minimum value at an output greater than x= — (see 

oCL 

8.4, Ex. 7). The marginal cost curve is a parabola with a positive 


minimum value 


3ac - 6^ 
3a 


at the output x = — , The curves are thus 
3a 


of the normal form shown in Fig. 74. 

As a final illustration of the elasticity concept, we can consider a 
problem very similar, in some respects, to the cost problem just 
discussed. The output of a firm, under given technical conditions, 
is determined by the amounts of the various factors of production 
employed. It is now assumed that the factors are always employed 
in the same fixed proportions. The output x of the good pro- 
duced is then uniquely dependent upon the proportion A by 
which the factors are increased (A>1) or decreased (A<1) from a 
given basic position. We can, therefore, take a: as a function of A 
and define 

. T • • Ado; d(loga;) 

Elasticity of productivity c == - ^ = d(lo(^) ’ 


If €>1, we have the case of increasing returns in which a small 
proportionate increase in all factors employed results in a more than 
proportional incresise in output. In the same way, € = 1 and €<l 
correspond to cases of constant and decreasing returns. • In the 
normal case, € decreases continuously as A, and the amount of the 
factors employed, increases. It is to be noticed that the reciprocal 
of € is analogous to K^in that it measures the elasticity of cost with 
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respect to output, cost being expressed in terms of the amount of the 
factors used instead of in money terms. But e is limited, as ic is not, 
to the case of factors used in fixed proportions.* 


EXAMPLES X 

Exponential and logarithmic derivativee 
1. Obtain the derivatives of the functions : 2®, e' 


1 1 


Xt-*, log(l-a:), log (*+!)(*+ 2), log >/!-*», log(l+x*), logj^, log|i^, 

j X “ .17 X X 

and log — • 

Va? 

2. By writing = e®(e*)®, verify that — = a€®*+*. 

8. Find the derivatives of (ax + 6)e* and (ax + 6)e“®. 

4. Show that ^ {i(«* + «“*)*} = «" - «'“• 

5. Show that ^(xloga:) = l + log*, 


dx 


log(x+^/x*-a*)=:'j~. 


1 


s/x^ ~ a* 
0,6, -o,6i 


6. Establish the general result 

d . a,a; + 6j_ 

dx ^a^x + b^ (aja; + 6j)(a,a7 + 6,)* 

7. Find the second derivatives of e“®, \og(ax + 6), x log x and a?* log x, 

8. Show that = (iC + r)e* and = ( - ir(a; ~ r)e“*, 

9. By logarithmic derivation, evaluate the derivatives of 


x^(x- 1)», 


{a7-l-l)(a;+2) 


and X 


Ix-^l 

Va;-l‘ 


(x-~l)(x-2) 

‘ 10. Find the derivatives of e®®*+*®+^ and (ox* + 6x + c) e* by taking logeirithms 
before derivation. 

11, By logarithmic derivation, show that 

^ (a;®) = x® ( 1 -f log x) and ^ (x"e®®+^) = (ax + 

12. Find the maximum and minimum values of x*e*. Show that (log a? - x) 
has only one maximum cmd (x log x) only one minimum value. 

18. Show that the curve i/ = xe~* has one maximimi point' and •one point of 
inflexion. Verify that y and its derivatives tend to vanish as x-^ao . Illus- 
trate from the graph of the curve (Examples IX, 17). 

♦ The concept of elasticity of productivity (Ergiebigkeitsgrad) is used by 
Prof. Schneider (Theorie der Produktiont 1934). It describes one aspect of the 
** production fimction *’ which will be discuss^ at a later stage. 
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14. Show that, on the curve = there are two points of inflexion 
separated by a maximum point. Hence, indicate the shape of the ciirve and 
describe its behaviour as aj-> ± oo (Examples IX, 18). 

16. If !/= J(e*4-e~*), show that y and are always positive while ^ 

changes sign at x — 0. Deduce that the curve representing this fimction 
roughly resembles a parabola with a minimum at a? = 0. How can the curve 
be derived from the curves y = c® and y = e“* T 

16. Ify = Y^j;^ri.ehowthat^ = y(l-y)and^ = y(l-j/)(l-2j/). Deduce 

that y increases continuously between 0 and 1 as x increases. Show that the 
curve representing the function has a point of inflexion at x = 0 and indicate 
its shape. (This is the logistic curve, see 16.2 (3), Ex. 3 below.) 

17. The selling value of a good is £/(<) after t years from the time when the 

fixed cost of production of £a was incurred. If £/(0 represents lOOr per cent, 
(reckoned continuously) on the outlay £a, find r as a function of t. It is 
assumed that the entrepreneur aims at maximising r. Show that the optimum 
storage period t and the (maximised) value of r are given by f(t) — ae^ and 
f'(t) = Illustrate diagrammatically and contrast this problem with that 

of 10.3. 

18. The cost of planting a piece of land with timber is £272. The value of 
the timber after t years is £100elv^<. Show that the present value of the 
timber is greatest (the rate of interest being 6 per cent, compounded continu- 
ously) if it is cut after 26 years. Show also that the maximum return (reckoned 
continuously) on outlay is 6J per cent, after 16 years. Hence illustrate the 
difference between the problem of 10.3 and that of the previous example. 
(Take log, 2-72=1.) 

19. It is assumed, at given prices of consumers’ goods, that the utility of 
an individual consumer is measurable and dependent on his income £x. How 
is the marginal utility of income then defined ? If it is known that marginal 
utility of income decreeises towards zero as income increases, show that 

u = alog- and u = ax^ are two possible forms for the utility function. 


Elasticities and their applications 

20. Show that the inverse function rule is of the same form for elasticities 
as for derivatives. Verify the rule by finding the elasticities of e* and log x 
separately. 

21. Evaluate the eleisticities of xe*, xe~* and 




22. If € is the elasticity of f{x), then the elausticities of xf{x) and 

(€ + 1) aind (e - 1) respectively. Check with /(x) = ox®. ^ 

• * 2P 

23. Find the elasticity of demand rj when the denmnd law is x= ^ and 

p = 3. Plot an accurate graph of the demand curve, draw the tangent at the 
point P where p = 3 and locate the points T and t where it cuts the axes. 
Hence verify that =*= MT ; OM = ON : Nt, where PM amd PN cure perpen- 
diculcurs to the axes. 
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24, If p cents per bushel is the price of com and x bushels the yearly con- 
sumption per head of com in the U.S.A., it is found that 


p 

60 

66 

60 

66 

70 

X 

30-4 

28-4 

26-7 

26-3 

24-0 


By plotting on a logarithmic diagram, show that the elasticity of demand 
for com can be taken as approximately constant for all prices. 

26. The following table gives the average N.Y. price ( p cents per lb.) and 
the yearly consumption (x th. short tons) of sugar, U.S.A., 1904-6 : 


Year 

P 

X 

1904 

4-8 

3100 

1905 

6-25 

2950 

1906 

4-6 

3210 


Plot on a logarithmic diagram and draw three parallel lines, one through each 
plotted point, so that the second line is equidistant from the other two. It is 
assimiod that the elasticity of demand for sugar is constant and that the 
logarithmic demand line has shifted downwards by equal distances over the 
three years. Estimate the elasticity of demand for sugar. (See Pigou, The 
Staliaticdl Derivation of Demand Curves, Economic Journal, 1930.) 

26. If A.R. and M.R. denote the average and marginal revenue at any 

A.R. 

output, show that V — output. Verify for the linear 

demand law p — a-bx, 

27. Find the elasticity of demand in terms of x for each of the demand 
laws p = ^a - bx, p = (a- hxY and p — a- bx*. Show that rj decreases as x 
increases and find where rj equals unity in each catse. 

28. If the demand law is x = a€~^^, express demand elasticity and total, 
average and marginal revenue as functions of x. Show that the demand is 
of normal form. At what output is total revenue a maximum ? 

29. A monopolist radio manufacturer produces lOOx sets per week at a total 

cost of £(^a;* +3a; + 100). Tlio demand is x=e getg per week when the 
price is £p per set. Plot the marginal cost and the marginal revenue curves 
on the same graph for outputs up to 50 sets per week and deduce that 
approximately 26 sets per week should be produced for maximum monopoly 
profit. (Take log, 10 = 2-3020.) 


30. If the demand law is x~p^e~^^^\ show that the demand increases as 

the price decreases, becoming large as the price approaches the^yalue ? . Find 

, a ^ 

the elasticity of demand for 8uiy price greater than t - Is the demand of 

noimal form? ® 


31. If a firm produces an output a; at a total cost of Tl—ax* + bx, find an 
expression for k, the elasticity of total cost. Show that k is always greater 
than unity and increases as x increases. 
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32. Show that, for the total cost function 77 = 'Jax + b, the elasticity of total 
cost increases but remains less than unity as x increases. 

X b 

33. The total cost function of a firm is 11— ax \-d. Show that the 

x + c 

average and marginal cost curves are of normal form if a, 6, c and d axe 
positive and if d< a(c - 6), the former having a minimum point and the latter 
rising continuously. 


X b 

34. If Il — ax^ + d is the total cost function, where a, 6, c and d are 

x + c 

positive (b<c), express average and marginal cost as functions of x. Show 
that ^ increases with x, being negative for small x and positive for large 
X. Show, also, that increases from zero to large values as x increases. 
Deduce that the cost curves are of normal form (see Examples V, 22). 


35. The production (y) of a good Y depends on the amoimt (x) of a single 
variable factor of production X according to the law y = ax'^(h~x), where a 
and b are positive constants. Express average and marginal product as 
functions of x and indicate that the forms of the corresponding curves can be 
taken as typical of the normal case of production (see Examples VII, 37). 


36. The supply of a good is given by the law x = a^p - 6, whore p (which is 
greater than b) is the price and a and b are positive constants. Find an 
expression for e, the elasticity of supply as a function of price. Show that e 
decreases as price and supply increase and becoiries imity at the price 26. 
What property of the supply curve holds at the point where e = 1 ? 



CHAPTER XI 


FUNCTIONS OF TWO OR MORE VARIABLES 
11.1 Functions of two variables. 

OxTR study of relations between variable numbers has been limited, 
so far, to the case of two variables, one of which is taken as a function 
of the other. This restriction can now be relaxed by defining 
relations involving many variables and by applying such relations 
in the interpretation of scientific phenomena. It is evident that all 
sciences are concerned with large numbers of inter-related variable 
quantities and that only by a process of severe simplification can 
functional relations between two variables be applied at all. For 
example, the volume of a gas depends on the pressure, on the tem- 
perature and on other factors. In economics, v4^hen an individual 
considers his purchases on a market, the demand for any good 
depends, not only on the price of the good, but also on his money 
income and on the prices of related goods. Again, to quote an every- 
day example, the sum of money extracted by a London taxi-driver 
from his passenger depends on the distance travelled, on the time 
taken for the journey and on the proportion expected by way 
of a tip. 

The formal extension of the function concept presents no dijfficulty . 
Suppose, first, that we have three variables, x, y and z, each with its 
appropriate range of variation. The variables are related by an 
implicit function if the values taken by them are not arbitrary but 
connected in some definite way. In general, we write 

/(*, y, 2)=0. 

Further, the variable z is an explicit function of the Varittbles x and y 
if the values of z depend in a definite way on those allotted arbitrarily 
to X and y. In this case, we write 

*=/(». y)- 

A given implicit function between three variables gives rise to three 
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distinct explicit functions ; each variable can be taken as a function 
of the other two. For, when values are allotted arbitrarily to two 
of the variables, the functional relation determmes the values of the 
third variable.^ When one particular explicit function is considered, 
say z as a function of x and y, the variable z is called the dependent 
and the variables x and y the independent variables. ^ 

The terms already used (2.4 above) to distinguish different kinds 
of functions extend at once. It is important to distinguish, in 
particular, betwe gi single-valued functions, to which the notion 
z pa^icularly applies, an d multi -valued functions. Of single- 

valued functions, we can note the class ^ifionotoni^fui^ ions where 
the dependent variable increases (or decreases) steadily as the 
independent variables increase. Again, the function z =/(a:, y) is said 
to be symmetrical in the variables x and y if an interchange of these 
Variables leaves the function unaltered in form. Finally, the con- 
cepts of limits and continuity extend easily to the case of a single- 
valued function of two variables. If, for example, the function 
z=f{x, y) approaches the limiting value A as a: approaches the 
value a at the same time that y approaches the value 6, we write 
/(x, t/)->A as x~>a and y->b. 

The function is continuous at the point x=a, y = b if z takes the 
definite value /(a, b) at the point and if z tends to the same value as 
X and y approach a and 6 respectively from either side. All ordinary 
functions are continuous except, perhaps, at a few isolated points 
where the variables assume infinite values. 

The following examples show some common cases of functions of 
two variables and their grouping into function types : 

Ex. 1. The implicit function 2x + 3y~z-l=0i8a particular case of 
the general linear functional relation 

ax-i-by + cz-^d = 0. 

Each of the explicit functions obtained here is single-valued. For 
example, from the particular fimction quoted, we have 

z = 2x + 3^-l; y = i{l~2x + z) and x=- J(1 --3y + 2 ). 

Ex. 2. The three explicit functions 

z=^3xy; z=x*-^y^ and - 2xy -y^ 

are instances of the explicit and “ homogeneous ” quadratic type 

2=oa:*+2Aa:y + 6y*. 
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In the first of these instances, the other two explicit functions are also 
single-valued. In the other two instances, x and y are double-valued 
functions. For example, in the third case, we have 

x^y±j2y^-{-z and y ^ -x ±j2x'^ - z, 
x^ 

Ex. 3. The explicit function appears in implicit form as 

a;* - xz + 2/2 - z = 0, 

and this is an instance of the implicit quadratic function type. 

Here, y is also a single-valued function of x and z but a; is a double- 
valued function of y and z. Another example of the same type is 

a;*-i-y*-i-z* = 16, 

where each variable is a double- valued function of the other two. The 
general implicit quadratic function type can be written 

oa:* + hy^ -l- cz^ + 2/yz -f- 2gxz + 2hxy -f 2itx 2vy + 2wz + d = 0, 
where all the coefficients are constants. 

Ex. 4. The explicit functions z = e**“*'*, z = log (a:* + y®) and z=xV 
illustrate the fact that exponential and logarithmic expressions can 
appear in functions of two variables. In the second case, for example, 
the expression (x^ + y^) takes a definite value when values are allotted to 
X and y and the logarithm of this value then defines z. 

11.2 Diagrammatic representation of functions of two variables. 

If three axes Oxyz are fixed to intersect at right angles at the 
origin O in space and if a definite scale of distance measurement is 
selected, then (from 1.9 above) a point P in space can be located 
by three co-ordinates (x, y, z). It will be assumed throughout the 
following development that Oz is drawn vertically upwards and that 
Ox and Oy are drawn, in a horizontal plane, in the W.E. and S.N. 
directions respectively. The co-ordinate z thus represents “ heights 
above or below the horizontal plane Oxy while the co-ordinates x 
and y represent distances E. and N. of the origin 0 in the plane Oxy, 
These conventions are adopted solely for ease in description ; they 
are not essential to the argument. ^ 

From a given single- valued function z=/(x, y), a table of values 
can be constructed by giving x and y arbitrarily selected values and 
by finding the corresponding values of z. To each pair of values of 
X and y there corresponds a point Q in the plane Oxy (OM =x and 
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ON = t/). A perpendicular QP is then erected of length equal to the 
value of z from the table (Fig. 75). The point P {x, t/, z) is thus 
obtained in space with co-ordinates related by the given function. 
The whole table provides a finite “ cluster of such points in space 
and, if the variables x and y are continuous, there is no limit to the 
number of points that can be obtained. The infinite cluster of points 
makes up what is called a surface in space. The surface has the 
property that it is cut by lines parallel to Oz in no more than a single 
point the height of which above or below Oxy represents the value 
of the function z =/ (x, y) for the values of x and y concerned. An 
easily visualised example of such a surface is provided by an open 
hemispherical bowl resting with its rim in a horizontal plane. 



More generally, if /(x, y, z) =0 is a given imphcit function defining 
z as a multi-valued function of continuous variables x and y, we can 
still represent it by a surface drawn in space. The heights of 
various points on the surface above Oxy represent values of z 
obtained in the function from given values of x and y but, since the 
function is multi-valued, the surface can be cut in more than a single 
point by a parallel to Oz. A sphere can be taken as a simple example 
of the kind of surface obtained when z is a double-valued function 
of X and y. • • 

To a functional relation between three variables x, y and z, there- 
fore, there corresponds a surface referred to axes Oxyz in space. 
Conversely, if axes and a scale of distance measurement are fixed, 
to any surface in space there corresponds a defimte functional 
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relation between three variables, the equation of the surface. The 
concept of a surface, it should be noticed, is not limited to the case 
of a surface without “ jumps ” or ‘‘ gaps ” and with an equation 
which is a continuous function of the variables. 

We can construct an actual three-dimensional model of any surface 
of which the equation is given (e.g. the surface z^x^ + y^). Axes 
Oxy are drawn in a plane board and a set of points is selected and 
marked to correspond to a set of values of x and y and to cover the 
plane fairly uniformly. At each point, a vertical pin is erected with 
height equal to the value of z given by the function. As more and 
more pins are erected, their heads are seen to describe a deiOnite 
surface, the surface representing the function. Such a model is the 
three-dimensional equivalent of the graph of a function of one 
variable but, on account of the labour involved, its construction is 
rarely a practical proposition. 

Again, if a function type is defined with the aid of certain para- 
meters (e.g. the type 2 =ax*-f then there corresponds a 

whole dass of sjirf^^ in space. Each surface of the class corre- 
sponds to the function obtained by giving definite values to the 
parameters. The systematic study of function types and surface 
classes is simply an extension of analytical geometry from two to 
three dimensions. For example, it is found that the class of planes 
in space is represented by the linear function type and that a 
class of surfaces known as paraboloids corresponds to the function 
type z=aa;*-h6y*. It is not proposed, however, to elaborate this 
interesting study here. 

11.3 Plane sections of a surface. 

Without constructing an actual model of the surface in three 
dimensions, it is difficult to visualise the form of a surface corre- 
sponding to a given function. This difficulty can be overcome, to a 
large extent, by a method of great practical and theoretical service, 
the method of “ plane sections The essentials of the method 
are contained in certain expedients familiar in cver 3 ’day life. An 
architect, for example, seldom constructs a “ working model of a 
house he has designed ; he draws ground and floor plans (horizontal 
sections) and cross-sections or elevations (vertical sections). Again, 
motorists and hikers ” do not carry relief models of Devonshire to 
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guide them about that county ; they refer to Ordnance maps which 
reduc/e the country to plane representations by means of a system of 
contours or horizontal sections. The methods underlying these and 
other practical devices are not difficult to generalise. 

A given plane cuts a surface in points lying on a curve called a 
plane section of the surface. Of all the possible sections of the surface, 
the most convenient are the horizontal sections by planes perpen- 
dicular to Oz and the vertical sections by planes perpendicular to 
Oxy, Each of these sets of sections can be considered in turn. 

A horizontal section of the surface z =f{x, y)ia a, curve lying in the 
surface at a given and uniform height above the Oxy plane. Taking 



different horizontal sections at various heights and projecting them 
on to the Oxy plane (by reducing the heights to zero), we obtain a 
set of curves in Oxy which can be called the contours of the surface. 
Each contour, to which can be attached the height (or value of z) 
which defines it, shows the variation of x and y for the given value 
of z. If the variables are continuous, the set of contours consists of 
a limitless number of curves all included in the general equation 

/(^, t/)= constant 

by allotting different values to the constant. In Fig. 76 are shown 
certain contcturs 6f the “paraboloid” surface with equation z == 0 ;® -f y®, 
at heights z = l, 2, 3 and 4. The contours are here concentric 
circles with centre at the origin. 

The shape of the surface, i.e. the variation of z with x and y, is 
shown in a very convenient way by the contour system. As the 
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independent variables change, a point (x, y) moves in the plane Oxy 
and the variation of z (the height of the surface) is seen by tracing 
how the point moves in relation to the contours. The value of z 
increases, remains fixed or decreases according as the point crosses 
from lower to higher contours, remains on one contour or crosses from 
higher to lower contours. Further, the values of z attached to the 
contours crossed indicate the actual changes in z. If, as is usual, 
the contours are for values of z at constant intervals, then z changes 
more rapidly and the surface is steeper when the contours are closer 
together. A curve across the contour system (e.g. the fine AB of 
Fig. 76) implies a definite change in x and y and the contours show 
how the corresponding value of z changes. The curve, in fact, is the 
projection on to Oxy of an actual path on the surface. The markings 
of roads, railways and other features of a country on a contoured 
Ordnance map are made on this principle. 

Vertical sections of the surface can be defined in a similar way. 

Fixing a value of t/, we have 
a section of the surface by a 
plane perpendicular to Oy 
showing the variation of z 
as X changes only. Such a 
section, which is conveniently 
projected on to the plane 
Oxz, is an “ elevation of 
the surface showing the up 
and down movement of a 
path on the surface in the 
W.E. direction. The section 
77 ^ can be traced, at least rough- 

ly, from the contour map 
already obtained. The curve of Fig. 77 is the elevation of the 
surface z=^x^-{-y^ for t/ = l, i.e. the parabola 2 = 1 It can be 
traced by observing how the line AB (y = l) crosses the contours of 
Fig. 76. By taking different planes perpendicular to Oy, a whole 
set of Vertical sections can be obtained in the plane Oxz, all sections 
being included in the general equation 

b) 

for different values of the parameter 6. Other systems of vertical 
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Bections can be derived by taking a set of planes perpendicular to 
Ox (on each of which x has a constant value) or by taking a set of 
vertical planes through Oz and at various angles to Ox (on each of 
which X and y vary in proportion). 

A system of sections by planes perpendicular to one or other of 
the axes shows, therefore, the related variation of two of the variables 
of the function for a given value of the third variable. Further, by 
arrangmg the sections of the system in order and relating them, we 
can even allow for the variation of the third variable. This idea of 
holding one variable constant while observing the variation of the 
other two is at least a first step in the examination of the complex 
variations of the complete function ; it Ues behind most of the 
analytical and diagrammatic expedients wliich are described below. 

11.4 Fimctions of more than two variables. 

There is httle difficulty in completing the extension of the function 
concept and notation to the general case where any number of 
variables is taken. An implicit function relating four variables 
x^ y, z and u is denoted by f(Xyy, z,u) = 0 and implies that one vari- 
able is a (single-valued or multi-valued) function of the other tluree. 
Further, to denote u as an explicit function of three independent 
variables x, y and z, we write, in general, u =f(Xy y, z). This notation 
applies particularly, but not exclusively, to the case where the 
function is single-valued. 

More generally, a functional relation between n variables 

/(^l, ^2, ••• ^n)=0 

gives one variable as a function of the other (n - 1) variables. Or, 
if a variable y is given as an expUcit function of n independent 
variables x^, Xg, x^, ... a;„, then we write 

y—f(pO\, X2, x^, ... 

The different variables related, or the different independent variables, 
are here conveniently represented by a single letter x with different 
suffixes. It is easy to see, in this notation, the number of the variables 
concerned. The notation should not be confused, however, with the 
practice we have sometimes adopted of adding suffixes to indicate 
particular values of one variable. 

Pure geometry as an abstract study is just as possible in four or 
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more dimensions as in two or three dimensions, and the correspond- 
ence between the functions of analysis and the concepts of geometry 
persists no matter how many variables are related. But actual 
dM|p:ammatic or graphical representations are no longer possible 
\imen more than three variables are involved. We should need to 
fix more than three axes intersecting at right angles and, unfor- 
tunately, the resources of concrete space are limited to three axes or 
dimensions. In the present development, therefore, we consider 
first the analysis of three-variable relations with concrete diagram- 
matic representations used as illustrations. We can then extend the 
results in a purely formal and analytical manner to the general case 
where many variables are related. Diagrammatic illustration finds 
no place in this extension, nor is it needed. Analysis, as we shall see, 
finds it little more difficult to deal with the general case than with 
the three-variable case. 

It is sometimes convenient, however, to use certain geometrical 
terms even in the formal analysis of functions of many variables. 
The use of the terms is merely descriptive and made by analogy with 
the three-variable case. So, a set of values of n variables 

(Xly X^y X^y ... X^ 

is called a ** point ” in n-dimensional space referred to n mutually 
perpendicular axes, and any relation between the variables is 
described as a “ hyper-surface ”, a “ locus ” of a “ point ” in n dimen- 
sions. In particular, the ‘‘ hyper-surface ” representing a finear 
relation between the variables is termed a ” flat ”, a concept which 
includes a straight line (two dimensions) and a plane (three dimen- 
sions) as particular and concrete cases. 

11.5 Non-measurable variables. 

It was seen (1.5 above) that many properties of scientific pheno- 
mena can be described by the measurable quantities with which our 
analysis has been concerned. There remain other properties capable 
of order without the additive character necessary fot direct measure- 
ment and described by ordered but non-measurable magnitudes. 
We have yet to investigate whether, and under what conditions, 
mathematical analysis can be appUed in such cases. 

A clue to the solution of our problem is to be found in the use of a 
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contour map of a function. As x and y vary continuously, a fimction 
z =/(a;, y) increases, remains constant or decreases according to the 
way in which the variable point (x, y) crosses the contour map of the 
function. There is no reference here to actual numerical changes ij|d[ 
only the order of the values of z and of the contours is needed. ^jjjjP 
possible to proceed to trace actual numerical changes in z by examin- 
ing how close together the contours are as we cross them. But this 
second and quantitative use of the contour map is quite independent 
of the first use. It should be possible, therefore, to take a non- 
measurable magnitude as dependent on measurable variables and 
represent it by a contour map used only in the first of the ways 
described. This is the point we must now develop. 

Suppose that the level assumed by a non-measurable magnitude 
depends uniquely on two measurable quantities x and y. As x and y 
vary, therefore, we can observe simply whether the magnitude 
increases, decreases or remains at the same level. A set of numbers z 
is now associated with the ordered set of levels of the magnitude so 
that z and the magnitude increase, decrease or remain constant 
together. Then z must be some function of x and j/, z=f{Xy y), a 
function which is, however, by no means unique. The magnitude 
can be represented just as well by any other variable number or 
function which changes always in the same direction as z=/(a;, y) 
when X and y vary. For example, the variation of the magnitude 
might be indicated by {f(x, y)y, by or by log/(x, y). In 

general, the dependence of the magnitude on the variables x and y 
can be expressed : 

l = F{z) wnere z =/(x, y), 
i.e. ^ = F{f(x,y)Y 

Here, z=f{x, y) denotes any one variable number which indicates the 
variation of the magnitude and { then denotes any other variable 
number y or the whole class of possible variable numbers, indicating 
the magnitude’s variation. F (z) is any function restricted only by 
the fact that F'(c) must be positive, so that { and z move always in 
the same direction. 

In diagrammatic terms, the variable magnitude is shown by the 
height above Oxy of any one of a large number of surfaces. The 
points on the different surfaces for a given pair of values of x and 
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y are at all kinds of heights, but they move up and down together as 
X and y vary. In particular, if x and y vary so that the height of one 
® surface is imaltered, then the same is true of all surfaces. The 

• rent surfaces have one feature in common ; they all have 
l^ally the same contour map. The values of ^ attached to the 
contours of the map change according to the choice of the function F, 
but this affect^ in no way the form and order of the curves of the 
map. Since is constant if z is, the equation of the contour 

map can be written, in a form quite independent of F, as 

f(Xy y) = constant. 

For example, the same non-measurable magnitude can be shown by 
z=:x^-\-y^f z = (x^-^y^)^, 2 = 6 **+*'* or z — \og{x^ y"^) 

and the contour map of each function is the same system of circles 

rr® 4- = constant 

shown in Fig. 76. The value of z attached to the various circles 
changes from one function to another but that is all. 

The dependence of a given non-measurable magnitude on two 
variable quantities x and y is described by the function F{f{x, y)}, 
wh^re/(a:, y) is any one form of the dependence and where F is any 
monotonic increasing function. The dependence can be represented, 
without ambiguity, by a system of curves in the plane Oxy defined 
by the relation f{x, y) = constant. As x and y vary, the path of the 
point {Xy y) across the curve system tells us whether the magnitude 
increases, decreases or remains constant, and the curve system must 
be used only in this non-quantitative sense. It follows that mathe- 
matical analysis can be applied, subject to the hmitations indicated, 
in problems concerned with non-measurable magnitudes. The 
development given here can be extended, of course, to cases where 
the magnitude depends on more than two measurable quantities. 

11.6 Systems of equations. 

It has been shown (2.9 above) that two equations provide, in 
general, a determinate solution for two variables. This result can 
now be taken up and generalised. If one relation 

/i(*. y)=^ 

is given between two variables x and y, there is an indefinitely large 
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number of pairs of values of x and y possible and the solution of the 
equation for x and y is said to be “ indeterminate If two relations 

/i(^, 2/)=0 and /a(a:, t/)=:0 _ ^ 

are given, there is, in general, only a finite number of possibl^jj^n 
of values of x and y and the solution of the equations is then ‘ -''IIHer- 
minate The solution may be unique (as in the case of two linear 
equations) or it may be multiple (as in the case a linear and a 
quadratic equation). If we are given three relations 

fi{x,y)^0, fi{x,y) = 0 and f»(x,y) = 0 


then the first two equations give a determinate set of solutions for 
x and y and these do not, in general, also satisfy the third equation. 
There is, in fact, no solution consistent with the three equations. 
The same result holds, a fortiori ^ if more than three relations between 
the variables are given. In diagrammatic terms, one equation is 
shown by a curve in the plane Oxy and the co-ordinates of any one of 
the infinite number of points on the curve satisfy the equation. Two 
equations are represented by a pair of curves which intersect, in 
general, in only a finite and determinate number of points. The 
co-ordinates of any point of intersection provide a solution of the 
equations. Three equations are represented by three curves and, 
since there is in general no single point lying on all curves, the 
equations have no solution. 

Very similar results hold of a system of equations in three variables. 
If the number of equations is less than three, an infinite number of 
sets of values of the variables is possible and the solution of the 
equations is indeterminate. If there are three equations, the number 
of sets of values of the variables is finite (unique or multiple) and the 
solution of the equations is determinate. If there are more than 
three equations, then the solutions of any three of them do not, in 
general, satisfy the others and the equations have no consistent 
solution. These solutions are supported by considering the three 
dimensional surfaces which represent the equations. 

A generqj result is now evident. A system of equations is given 
between n variables. Then : 


(1) If there are less than n equations, the system is indeterminate 
in the sense that an infinite number of sets of values of the variables 


is possible. 
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(2) If there are exactly n equations, the system is determinate in 
the sense that only a finite number of sets of values of the variables 
Is possible, the solutions of the equations. 

^H|jf there are more than n equations, there is no one set of values 
oUBlpf^ariables consistent with all equations. 

One restriction on the generality of these results must be noticed. 
It is essential that the equations of the system should be mutually 
consistent with each other and independent of each other. In fact, 
it is not permissible to include an equation in the system either 
if it cannot hold if one of the others does or if it automatically holds 
if the others do. For example, the equation 2a: -f 2i/ - 4z = 6 cannot 
be included in the same system as a: + y - 2z = 3 ; they are not 
consistent. Nor can the equation 2a: -f 2y - 4z = 6 be included with 
a; 4- y - 2z = 3 ; they are not independent equations and the one tells 
us no more about the variables than the other. Hence, before the 
solution of a system of equations can be examined, we must see 
whether the system is mutually consistent and independent. It 
often happens, for example, that the number of equations in a given 
system is one more than the number of variables, so that it would 
appear that no solution exists. On examination, however, it is found 
that one of the equations is derivable from (i.e. not independent of) 
the others. When this equation is discarded from the system, the 
number of equations equals the number of variables and the system 
is determinate.* 

The solution of a system of equations can also be approached from 
a different angle. One equation between n variables determines one 
variable as a function (not necessarily single-valued) of the other 
(n - 1) ; two equations determine two variables as functions of the 
other (n - 2) ; and so on. In general, if there are m equations in 
n variables (m<n), any m of the variables can be determined as 
functions of the other (n-m). Then, if further equations between 
the variables are found, the m selected variables can be eliminated 
by the substitution of the functions already obtained, and the new 
forms of the equations involve only (n ~ m) variables. Thus each 

♦ This case often arises in the analysis of economic equilibrium. See, for 
example, Pareto, Manuel d'kconomie politique (2nd Ed. 1927), pp. 591-3 and 
pp. 010-5; Bowley, The Mathematical Groundwork of Economics (1924), 
pp. 20-2 and p. 51. 
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equation of a system can be used to determine and eliminate one of 
the variables in terms of the others. As long as the number of 
equations is less than the number of variables, there remain some 
variables which cannot be determined and eliminated and to whi<ii 
values can be allotted at will. The equations are indeterminate^ 
But when the number of equations and variables is equal, we can 
eliminate the variables one by one until a single equation in one 
variable is left to complete a determinate solution. For example, 

+ + 2 + 1=0 and 2a;-y ~ 22 : + w + l=0 

are two equations from which we find 

z = \{2+X‘-2y) and u = J(1 - 4a: -y). 

If, now, a third equation is given 

+ + + = 

we have on substitution an equation in the variables x and y only : 

\Zx^ 2xy ^-ly^ - 2x - by - 15 = 0. 

From the system of three equations, two have been used to eliminate 
two variables and the other gives an equation in the remaining 
variables. 

• 

11.7 Fimctions of several variables in economic theory. 

It is clear that functions of many variables must play an important 
part in any precise and general interpretation of economic pheno- 
mena. This fact has been implicitly recognised already when we 
found that fimctions of one variable could only be applied when the 
problems were simplified and abstraction made of the inter-relations 
of the many variable quantities involved. We are now in a position 
to reconsider the problems in a more general light. 

To generalise the problem of demand, we suppose that n consumers’ 
goods Xj, Xj, ... are sold at uniform prices pi, pa> Vz> ••• 
on a competitive market consisting of a fixed number of consumers 
with given tastes and incomes. Then the amount x^ of any one good 
X^ demanded by the market is uniquely dependent on the prices of 
all the goods on the market. We can thus write 

*r = ^r{j>l. —Pn) 

as the demand function for the good X^, a function which, for con- 
venience, can be assumed continuous in aU the variables. 
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The number of the variables overcrowds our picture of market 
demand. It is possible, however, to select a few of the prices accord- 
ing to the particular aspect of the problem considered and to assume 
that all the other prices are fixed. In particular, we can study the 
inter-relations of the demands for two goods and X, by assuming 
that the prices of all other goods are fixed. Then 
= <f>i iPiy P2) and rra = ^2 p^) 

are the demand functions, each dependent on the two variable 
prices. Each function can be shown as a demand surface with heights 
above the horizontal plane OpiP2 representing the varying demand 
for the good. The vertical sections of such a surface are particularly 
interesting. The section of the surface {p^, P2) by any plane 

perpendicular to Op^ (on which p^ has a fixed value) is an ordinary 
demand curve showing the variation of as p^ varies. There is one 
such demand curve for each fixed price p^ of the other good and the 
whole system of demand curves shows the way in which demand 
shifts as the price p^ is changed. AU these demand curves are down- 
ward sloping in the normal case. The section of the surface by a 
plane perpendicular to Opi is a curve showing the variation of x^ as 

varies for a given value of p^. At any point where this section is 
upward sloping, a rise in the price of results in a rise in the demand 
for and the goods can be called “ competitive at least in a rough 
sense. If the section is downward sloping, the converse holds and 
the goods can be called “ complementary ’’ at the prices concerned. 

It is often convenient to assume, as an approximation for certain 
ranges of the prices, that the demand functions are of definite types, 

. The following are examples of demand functions of relatively simple 
type in the case of two related goods : 

( 1 ) = Oj — Oui Pi -f Pa, ^21 2^1 ~ ^22 Pa* 

In tlfe normal case of demand, the constants 0^ and ajj, as well as 
Oj and Oj, can be taken as positive. But the constants and can 
be of either sign. The goods are “ competitive ” if they are both 
positive and “ complementary ” if they are both negative. It can 
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be noticed that the demand functions (3) and (4) become hnear if 
the demands and either both or one of the prices are taken on 
logarithmic scales. 

A slightly different presentation can be given to show the demand 
of an individual consumer instead of that of the whole market. If a 
consumer has given tastes, his demand for a good is a function 
of his income ja and of all the prices : 

Xr = Ply p2i p^y ••• Pn)* 

As a problem of particular interest, we can trace the effect on the 
demand for X^. of changes in income and in the price of X^. JI all 
other prices are taken as fixed, we have a demand surface 

^r = <f>rilJ; Pr), 

showing the demands for X, for various incomes and prices. The 
section of the surface by a plane perpendicular to 0/x is an ordinary 
demand curve for a fixed income level. As the income level is 
changed, the section shifts in position. We can thus observe the 
shifting of demand consequent upon the variation of individual 
income. On the other hand, if tlie price of the good is fixed, the 
variation of demand as income increases is shown by the section of 
the surface by a plane perpendicular to the axis Opr- 

A generalised cost problem provides a second example of the use 
of functions of several variables. If a firm produces different goods 
or different qualities of the same kind of good, and if the technique 
of production and the supply conditions of the factors of production 
are given, then definite amounts of the various goods can be obtained 
at a determinate minimal total cost : 

n = F{X^, X^, Xg, ... xj, 

where Xj, Xg, Xg, ... x„ are the amounts of the goods Xg, Xg, ... X„ 
produced. If only two goods X and Y are produced, the cost 
function n = F{x, y) can be represented by a cost surface with heights 
above the horizontal plane Oxy showing the costs of producing 
various combinations of outputs. The vertical sections of the 
surface reprqsent^the cost of producing a variable output of one good 
in conjunction with a fixed output of the other good. A particular 
form of the joint cost function, such as 

i7=ax2 4- 2hxy + by^ -f 2gx + 2fy + c, 
can be assumed for convenience in certain cases. 
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The transformation functions of 5.6 above can also be generalised 
in an obvious way. The technique of production and the pro- 
ductive resources of a given firm or country are fixed. Then, if 
Xif Xj, ... Xn are the amounts of various goods X^, X,, Xj, ... X„, 
that can be produced in a given year, or the incomes obtainable 
in a series of years, there must be some relation between these 
variables : 

F{xi, a;,, rrs, ... x^) = 0. 

For, given the outputs desired of all goods but one, or the incomes 
desired in all years but one, then technical considerations determine 
the amount of the remaining output or income. When there are 
three goods or incomes, we have a transformation surface 

Vy 2)=0 

referred to axes OXy Oy and Oz along which amoimts of the goods or 
incomes are measured. In the normal case, we can take the surface 
as downward sloping and convex to the origin at aU points. A simple 
case of the normal type of transformation function is 

where a, 6, c and d are positive constants. 

11.8 The production fimction and constant product curves. 

An important problem, to which only incidental reference has so 
far been made,* concerns the conditions under which factors of 
production are combined in the production of a given good by a firm 
or industry. With given technical conditions of production, the 
amount of a good X produced depends uniquely on the amounts of 
the variable productive factors, A^y A^y A^y ... A^ used. If a; is the 
amount produced when amoxmts o^, Uj, a^y ... of the factors are 
employed, we can write the production function 

x=f{ai, a„ a„ ... oj. 

It will be assumed, throughout the present development, that the 
factors of production are continuously divisible and that the pro- 
ductive process is continuously variable. The production function 
is then a continuous function of continuous variables. It is to be 
noticed that our presentation of the problem does not exclude the 


* See 10.8 above. 
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possibility of the variable factors being used with certain fixed factors 
of production. A definite production function can be assumed in 
all cases, no matter how many factors are fixed and how many 
variable. Only the appropriate form of the function depends on 
considerations relating to the nature, the variability or otherwise, 
of the factors. 

If there are two variable factors A and B, the production function 
X =/(a, b) can be represented by a production surface referred to axes 
of which Oa and Ob are taken horizontally and Ox vertically. The 
method of plane sections is now of great advantage. The contours 
of the production surface consist of a system of curves in the plane 
Oab which can be termed constant prodmt curves and are defined by 

/ (a, b) = constant. 

One curve of the system, corresponding to a given value of the 
constant, includes all points (a, 6) representing amoimts of the 
factors giving a definite product x^. The curves form a continuous 
and non-intersecting system covering the positive quadrant of the 
plane Oab in such a way that one, and only one, curve of the system 
passes through each point. As the amount of the factors used are 
changed in any way, the corresponding point (a, b) moves in. the 
plane Oab and its path across the constant product curves determines 
the resulting variation of the product obtainable. 

The vertical section of the production surface by a plane perpen- 
dicular to Ob (on which the value of 6 is fixed) is a curve showing the 
variation of product as various amounts of the factor A are used with 
a given amoimt of the factor B. Thus 

x=f{a, bi) = <f>i{a), x=f{a,bt) = ^i{a), ... 
are the equations of the vertical sections for fixed amounts 6i, 6a ... 
of the factor B, They are different “ elevations ” of the production 
surface. A similar set of sections or “ elevations ” is obtained by 
fixing the value of a and taking planes perpendicular to Oa. A third 
set of vertical sections, corresponding to “ elevations ’’ of the surface 
of a differeiat kind, can also be used to advantage. The sections are 
obtained by taking planes through Ox and at various angles to Oa 
and 06, and they show curves giving the variation of product as the 
factors are varied in proportion. Thus, if a fixed point on one of the 
sections corresponds to amounts Oj and 6i of the factors, then any 
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other point on the section is given by a = Aoj and 6 = A6i, where A is a 
variable proportion. The equation of the section is 

a;=/(Aai, Xbi) = <f>{\), 

a function of the variable A (see 10.8 above). Different sections of 
the complete set are obtained by allotting different values to the 
constants Oj and b^. 

In the “ normal ** case of production, the production surface can 
be subjected to certain restrictions apart from that of continuity. 
It can be assumed, in the simplest ‘‘ normal ’’ case, that product can 
only be maintained, when less of the factor B is used, by increasing 
the use of the factor A. Further, as the substitution of the factor 


3 



o] a 

Fia. 78. 


A for the factor B proceeds, increasingly larger additions of A are 
needed to compensate (i.e. to maintain product) for a given reduction 
of 5.* The constant product curves are then downward sloping and 
convex to the origin at all points of the positive quadrant of Oaby 
i.e. they form a system of curves similar in form to that of the 
“ normal ** indifference map described in 6.7 above. 

In a more general “ normal ’’ case, the properties as^med above 
apply only to certain ranges of employment of the factors while, out- 
side these ranges, product is maintained only by increasing the use 

* This is the principle of “ increasing marginal rate of substitution ” (see 
13.7 below). 
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of both factors. The form of the constant product curve system is 
then that shown in Fig. 78. In the area of Oob between the curves 
OA and OB, the curves are downward sloping and convex to the 
origin ; outside this area, the curves turn back and slope upwards. 
Here, when one factor is used in much greater proportion than the 
other, an increase in the factor’s use requires more of the other factor 
even to maintain product. 

The normal form of the vertical sections of the production surface 
can be traced from that of the constant product curves. If a line 
distant from the axis Oa is drawn in the plane Oab, the way in which 
it cuts the constant product curves shows the variation of product 



on the section perpendicular to Ob corresponding to the fixed amount 
bi of the factor B. In the simpler normal case, the line cuts higher 
and higher constant product curves from left to right, i.e. the vertical 
section rises as a increases. In the more general normal case of 
Fig. 78, the product increases as a increases at first, attains a maxi- 
mum value when a equals a certain value cq (where the line cuts the 
curve OA and touches a constant product curve) and then decreases 
as a increases further. The vertical sections of the production 
surface, for fixed amounts 6 ^, b^j 63 , ... of the factor B, are of the 
form shown in 'Fig. 79. Each section has a “ peak ” product corre- 
sponding to a combination of factors shown by a point on the curve 
OA of Fig. 78. The greater the fixed amount of B used, the more 
of A do we need before we obtain the peak product and the greater 
is the peak product. The curve OA of Fig. 78 is thus of great interest ; 
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it correspondfl to a “ ridge line ” of the production surface and indi- 
cates the peak product obtainable from any given use of the factor B. 
The vertical sections of the production surface for fixed use of the 
factor A are similar in form and lead to the definition of the curve 
OB of Fig. 78 as another “ ridge line ” of the surface.* 

A particular production function of the simpler normal type is 

where a is a positive fraction. f The constant product curve, for the 
given product a;=a;x, is given by 


i.e. 


5 = 




where .4'= 


and m =7 are positive constants. So 

1 - a 


d6__ mA' 
da ~ ^ 


and 


-f 1)4' 

da^ ^ ’ 


and the constant product curves are downward sloping and convex 
to the origin at aU points. In the particular case where a = | the 
curves are rectangular hyperbolas with Oa and Ob as asymptotes. 
Further, the vertical section for the fixed amount bi of the factor B is 

x=A"a<^, (A"==Abi^-^). 

Since a is a positive fraction, the section rises from left to right at a 
decreasing rate and never attains a “ peak " value. Notice that 

log X = log ^ + a log a + ( 1 ~ a) log b. 

On taking logarithmic scales for all variables, the production surface 
becomes a plane and the constant product curves straight lines. 

A production function of the more general normal type is 

x = 2jya6-^a*-£6*, 

where A, B and H are positive constants such that B^>AB, It can 
be shown that the constant product curves are of the form of Fig. 78 

♦The normal form of the vertical sections is described by Knight, Risk, 
Uncertainty and Profit (1921), pp. 100-1. 
t See, for example, Douglas, The Theory of Wagea (1934). 
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Math OA and OB as straight lines (see Examples XIII, 32), The 
vertical section for which 6=6^ has equation 

x = - Aa* + 2Hbia - Bbi*, 

( H*-AB^\ ,( II, \* 

.... (* = . 

i.e. the section is a parabola with axis vertical. The peak product 

H* - AB H 

^ is obtained when the amount ~ 6^ of the factor A is used 

A 

with the fixed amount b^ of the factor B. It is to be noticed that a 
zero (or negative) product is obtained if the amount of the factor A used 

is less than ^ (H-JH^-AB) or greater than ^ + - AB), 

For intermediate amounts of the factor, the product rises from zero 



to a peak and then falls to zero again. Fig. 80 shows certain constant 
product curves and vertical sections when the product of wheat in 
bushels is 

a; = 2(12a6- 5a* -462), 

when 100a men-hours of labour are employed on 6 acres of land. It 
appears, for example, that no wheat is obtainable from 10 acres when 
less than 400 men-hours are employed and that the wheat product 
rises to a fiiaxi'mum of 640 bushels when 1200 men-hours are em- 
ployed. 

11*9 The utility function and indifference curves. 

An individual consumer has a scale of preferences for two con- 
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Burners^ goods X and Y represented, according to the conditions of 
5.7 above, by the indifference map with equation 

(f) [x, y) = constant. 

Denote u = 

a continuous function of two continuous variables. The original 
indifference map is then the contour map of the surface representing 
this function. The variable u takes a constant value on any one 
indifference curve and increases as we move from lower to higher 
indifference curves. Hence, as the purchases of the individual 
change, the value of u increases, remains constant or decreases 
according as the change leaves the individual better off, indifferent 
or worse off, i.e. the value of u indicates the level of preference or the 
utility of the purchases x and y to the individual consumer. 

The concept of preference or utility, however, is an ordered but 
non-measurable one and the remarks of 11.5 above are of direct 
application. The function u = <f){x, y) is only one way of indicating 
the dependence of utility on the purchases. In general, if F{u) is 
any monotonic increasing function of u, then 

U = F{u)=F{<l>{x, y)} 

serves as an indicator of utility and can be termed the general index- 
function of utility,'^ It is quite immaterial which utility function, 
e.g. u = (f>{x, y), i^=log (f>(x, y), , i& used to represent the depend- 

ence of utility, as a non-measurable magnitude, on the individuars 
purchases. The surfaces representing the various functions, referred 
to axes of which Ou is vertical, are different but rise and fall together 
and have the same contour map, the indifference map of the indi- 
vidual as originally defined. 

The problem is generalised by assuming that the individual has 
a definite preference scale for the goods Xg, Xj, ... X„ appearing 
in his budget, a scale described by the indifference map 
(f>{Xi, X 2 , Xg, ... = constant. 

For a given value of the constant, this relation connects all purchases 
which are indifferent to the individual while, for incFeasifig values of 
the constant, we move from one set of indifferent purchases to 
another set at a higher level of preference. If there are three goods 

♦ Tliis is a translation of the term “ fonction-indice ** introduced by Pareto, 
Economic mathimatiquc (Enoyclop6die des sciences math^matiques), p. 596 
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X, Y and Z, the indiflerence map is represented by a system of 
surfaces in Oxyz space which, in their normal form, fit into each other 
as a series of shells, all downward sloping and convex to the origin. 
For more than three goods, no diagrammatic representation is 
possible, but the indifference map can be described, by analogy, as 
a system of hyper-surfaces in many-dimensional space. The indi- 
vidual’s utihty is again a non-measurable concept dependent on the 
purchases according to the general index-function of utility 
u =F{u) = F{<f>{Xi, Xj, a; 3 , ... x„)}, 

where F {u) is any function such that F' {u) > 0. The particular utility 
function u = (f> x^^ x^) is only one of many possible forms. 

A similar representation applies when we consider, not the pur- 
chases of the individual of different goods at one point of time, but 
the flow of his purchases or income over time. If x^, Xg, Xg, ... x„ 
represent incomes in successive years according to the conditions of 
6.8 above, we have a definite mdifference map 

0(^1) ••• ^n) = coi^stant, 

and we can write a corresponding index-function of utility 

U=F{u)=F{llj(Xiy X^y Xg, ... xj}, 

where F{u) is any monotonic increasing function. 

As an approximate representation of a normal indifference map 
for two goods or incomes, we can sometimes take the particular fonA 

u = {x-{-aY (y + 6)^, 

where a, 6, a and j3 are positive constants. Any monotonic increasing 
function of u, for example the function 

u' = logu~(x log(x + a) -f jS log(y + 6), 
serves equally well as an index of utility. The indifference map, 
common to all forms of this particular utility function, is shown in 
Fig. 25 in the case where a=^=l and a = 2, 6 = 1. It is seen to be 
of normal form. A rather different utility function 

u=ax-\-by + cJxy 

also corresponds to a normal case of an mdifference map for two 
goods or inj3omes. Any monotonic increasing function of u, such as 

serves equally well as an index of utility. The corresponding 
indifference map is shown in Fig. 26 in the case where a = 6 = 1 
and c = J2. 
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EXAMPLES XI 
Functions of two or more variables 

QCy 

1, Express the function 2 = - ^ implicit form and show that each 

variable is a single-valued function of the other two of the same form in eeuch 
case. Deduce that the sections of the surface representing the function by 
planes perpendicular to an axis are rectangular hyperbolas. In what sense is 
this a symmetrical function? 

2. Illustrate graphically the form of the contours, and of the other sections, 
of the surface z = ^/^. 

8. Show that the sections of the surface x + y + 2z — Z by planes perpen- 
dicular to an axis are parallel straight lines and deduce that the surface is a 
plane. Locate the points where the plane cuts the axes. Generalise to show 
that ar + 6 y-fC 2 ; + d = 0 is always the equation of a plane. 

4. Exeunine the sections of the surface x* + y* + z* = a* and show that it is 
a sphere of radius a. 

5. Show that «• + y • ~ z* = 0 is the equation of a circular cone with its axis 
along Oz. 

6 . Show that the contours of the surface j** + y* + 2 * - 2xz - 2yz = 0 form 
a system of circles and deduce that the surface is a cone. 

7. By a geometrical construction, show that ^ 

mid-point between two given points {x^, zj and (a:,, y,, 2 ,) referred to 
rectangular axes in space. Show that the mid-point between any two points 
ona; + y + 2 -|-l = 0 is also in the surface and deduce that this is the equation 
of a plane. 

8. If (a?!, yj, Zj) and (a;,, y,, z,) are two given points referred to rectangular 
axes in space, show that the distance between the points is 

(*i - *1)’ + (j/i - j/i)’ + - *i)‘- 

Deduce that the equation of a sphere with centre (a, 6 , c) and radius r is 
(a;-a)* + (y- 6 )* + (z-c)» = r*. 


9. Show that any point on the surface -H y * + z* - 2z = 0 is such that its 

distance from O is twice its distance from Oxy, 


10, By examining the sections of the surface by various planes, show that 
2 = a;* -H y * is a surface obtained by revolving the parabola z = x* about the axis 
Oz in the plane Oxz. 


11, Show that z=/(x* + y*) is a surfa<;e obtained by revolving the curve 
z = /(x*) about the axis Oz in the plane Oxz, Illustrate by considering the 
surfaces z = X* -f- y*, z = and z = log (x* + y*). 


12* If z = , show that z -► dt 00 as x -►a and y ->o, where a is any constant. 

x-y 


Dedu^ that the surfetce represented by the equation has “ infinities ” at all 
points above the line x - y = 0 in the plane Oxy, Check from the vertical 
sections of the surface. 
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13. Show that each section of the surface z = (x + y)e~^*^*^ by planes 
perpendicular to Oy haa a single maximum point and that the sections are 
the same curve translated varying distances parallel to Ox. 

14. Show that, for each of the functions 

Zzz:il x*y, z = •/x* - 2/* and z = — — , 

x-\-y 

the value of z changes from z to Az when the values of x and y are changed in 
proportion from a; to Aa; and from y to Ay. Deduce that the sirnface repre- 
senting any of these functions has a straight line section by a vertical plane 
through 0 €uid a given point on the surfeice. 

15. For any of the functions of the previous example, show that z = x<f>(r), 

where r = -, and that z==ytlt(8), where s = - . 

X y 

16. If <j>(x) and tft(y) are two functions of single variables, show that the 
sections of the surface z = <^ (a;) + 0(y) by planes perpendicular to Ox or Oy are 
of the same shape but variable height, and that similar sections of the surface 
z= <ji(x)\p(y) consist of a curve “ stretched ** or “ contracted ” by a variable 
amoimt in the direction Oz. 

17. In what sense can it be said that the function w = «• + y* + z* has con- 
tours consisting of a set of concentric spheres in Oxyz space T 

18. Show that a? + y + z=l, a?H-3y-z = 3 and 2a?-y + z=l have a imique 
solution mx,y and z. Illustrate by drawing the planes which represent these 
equations. 

19. Have the equations a;4-2y-z=l, 2a;-y + z = 3 and a: + 7y - 4z = 0 any 
solution? Verify that one equation can be deduced from the other two, 
i.e. that the equations are not independent. 

20. Eliminate z and u from the equations 

2xy 4-z--tt4-l = 0, a;*4-y*-2z-f-ti = 2 and 4x*y * + z*-u*+l = 0 

and obtain an equation in x and y only. 

Economic functions and surfaces 

21. If £z is the present value of £y available x years hence, interest being 
reckoned continuously at lOOr per cent., show that z — ye~^^. Show that the 
contours of the surface z — ye~^^t for a given value of r, form a system of 
discount curves. What are the vertical sections ? 

22. The demand for tea is a;, = 40 — and for coffee x, = 10 — thousand lbs. 

Vi ^ ^ P% ^ 

per week, where Py and p^ are the respective prices of tea and coffee m pence 
per lb. At what relative prices of tea and coffee are the demands equal? 
Draw a graph to show the shifts of the demand curve for tea when the price 
of coffee increases from 2s. to 2s. 6d. and to Ss. per lb. 

Show ttat Xy = ay^ and = — cPi are two simple examples of a demand 
law for a good Xy in competition with a good X, and that 

ar, = — — fuid a?- = ~c-Pi 
PiVt Pi 

are corresponding laws when Xy and X, are complementary. 
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24, The demands and of a consumer for two goods at market prices 

. a,-^hyX, a« + j 

p, and p, are related by the condition = -f where the a s and 

^ j 

the 6*s are constants. If /x = x^p^ + x^p^ is the consumer’s mcome, find the 
demands as functions of /x, p^ and p,. If the prices are fixed, show that each 
demand is a linear function of /x. 


26. It is known that x = is the product (in bushels) of wheat when 

100a men-hours of labour are employed on b acres of land. Find the constants 
A and a, given that 1500 bushels can be obtained from 100 acres when 10,000 
men-hours of labour are employed and that 2120 bushels can be obtained from 
the same area when 20,000 men-hours of labour are employed. Draw a graph 
of the variation of product as varying amoimts of labour are applied to 100 
acres. What is the product when 20,000 men-hours of labour are employed 
on 150 acres? 


26. The production function is x = Aa^b^f whore A, a and jS are constants. 
If the factors are increased in proportion, show that the product increases in 
greater or less proportion according as (a + P) is greater or loss than unity. 
How is this property shown on a vertical section of the production surface 
through O and a given point on the surface? What is the special property of 
the case a = 1 - /3? 

27. The following data are taken from Douglas, The Theory of Wages : 

Manufacturing Industries, U.S,A., 1900-20. 
a; = Day’s index of physical volume of manufactures (1899= 100). 
a = Average number of wage-earners in manufacture (1899= 100). 

«6 = Volume of fixed capital in manufacture (1899 = 100). 


Year 

X 

a 

b 

Year 

X 

a 

b 

Year 

X 

a 

6 

1900 

101 

105 

107 

1907 

151 

138 

176 

1914 

169 

149 

244 

01 

112 

110 

114 

08 

126 

121 

185 

15 

189 

154 

266 

02 

122 

118 

122 

09 

155 

140 

198 

16 

225 

182 

298 

03 

124 

123 

131 

10 

159 

144 

208 

17 

227 

196 

335 

04 

122 

116 

138 

11 

153 

145 

216 

18 

223 

200 

366 

05 

143 

125 

149 

12 

177 

152 

226 

19 

218 

193 

387 

06 

152 

133 

163 

13 

184 

154 

236 

20 

231 

193 

407 


Find the value of x given each year by the production function x = and 
express as a percentage of the actual value of x. Show that the deviation 
never exceeds 10 per cent, and find the average percentage deviation (neglect- 
ing signs). Plot a graph on a logarithmic scale of the variation of the actual 
and estimated values of x over the whole period. 


28. It» is given that a; = 40 bushels of wheat are obtained 

a + 6 

when 100a men-hours of labour are employed on b acres of land. If 10 acres 
are cultivated, draw a graph to show the variation of product as a varies. 
Show that the greatest product is obtained from this area when approximately 
1 050 men-hours of laboiu* are employed. 
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29. Consider vertical sections of the production surface 

_2Hab-Aa*-Bb* 

*“ Ca + Db 

and show that a proportional increase in the factors results in an equal pro- 
portional increase in product. 

30. Show that the production surfaces 

x=2Hab-Aa^-Bb* and x=:^2Hab - Aa^ - 

have constant product curves of identical form but that the variation of 
product from one curve to another is different. How does product increase 
in the two cases as the factors are varied in proportion from any combination ? 

31. If a and b are positive constants, show that the utility function 

u=(x - a)* + (y - b)* 

gives an indifference map of normal form for purchases x and y in the ranges 
the curves being arcs of concentric circles. 

32. Examine the utility function u = — , where a, b and c are positive 

c- vt/ + 6 

constants, and show that the indifference map is a set of parabolic arcs and 
of normal form for certain ranges of values of the purchases x and y. 



CHAPTER XII 


PARTIAL DERIVATIVES AND THEIR APPLICATIONS 


12.1 Partial derivatives of functions of two variables. 


In a function of two variables z=f(x, y), the two variables x and y 
can be varied in any way quite independently of each other. In 
particular, one of the variables can be allotted a fixed value and the 
other allowed to vary. The function, in such a case, reduces to a 
function of a single variable only. Two functions can be obtained 
in this way, 2 as a function of x only {y fixed) and z as a function 
of y only (x fixed). The derivative of each of these functions can be 
defined at any point and evaluated according to the familiar tech- 
nique. The derivatives so obtained are called the “ partial deriva- 
tives ** of the function z=f{Xy y), the term “ partial ’’ implying that 
they are defined only for very special variations of the independent 
variables. One partial derivative follows when x is varied and y kepi 
constant, the other when y is varied and x kept constant. 

*If the function z=f{x, y) is single-valued, its partial derivative 
with respect to x at the point (x, y) is the limiting value of the ratio 


^ as the arbitrary increment h tends to zero and the 
partial derivative is written either as — =z — f{x, y), or as z^.' =/.' (x, y). 


The notations are similar to those for ordinary derivatives. But, to 
indicate that we have the partial derivative with respect to x (i.e. that 
the other variable y is regarded as fixed), we use the symbol 9 ” 
instead of the previous “ d ” in the first notation and add a suffix x 
in the second notation. An exactly similar definition and notation 
can be given for the other partial derivative. Hence : 

Definition : The partial derivative of z=/(x, y) with respect to x 
at the point (x, y) is 
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and the partial derivative with respect to y at the same point is 

It is sometimes convenient to use slightly different notations. For 

example, and can be used instead of and — . Further, when 

ox dy dx dy 

no ambiguity is likely to arise, it is possible to use/j. and/^ as simple 

notations for the two partial derivatives. 

The meaning of partial derivatives is most clearly seen when they 

are interpreted in diagrammatic terms. In Fig. 81, P is the point on 



X 


Fio. 81. 

the surface z =f{x, y) defined by the values x and y of the independent 
variables. Two vertical sections of the surface can be drawn through 
P, one perpendicular to Oy and the other perpendicular to Ox, The 
former is the curve on the surface passing through P in the W.E. 
direction and showing the variation of z as a; varies. On this section, 
the value of y retains throughout the constant value allotted initially 
at P. The tangent PP* to the section at P has gradient measured 
by the deriv^tivfe of 2 as a function of x (y constant), i.e. by the 

dz 

partial derivative evaluated at (x, y). Hence, — is to be interpreted 

as the gradient of the W.E. section of the surface at P or, more 
shortly, as the W.E. gradient of the surface at P. In the same way, 
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dz 

the value of — at {x, y) measures the gradient of PTy, the tangent 

dy 

at P to the vertical section of the surface perpendicular to Ox^ i.e. the 
S.N. gradient of the surface at P. Hence : 

The partial derivatives ^ and ^ measure the gradients of the 

surface z=f{x, y) in two perpendicular directions at the point P 
defined by (x, y), the directions in the planes perpendicular to Oy 
and Ox respectively. 


It is to be noticed that we have not yet obtained a measure of the 
gradient of the surface in any other direction. 

The remarks (6.2 above) on the meaning of an ordinary derivative 
apply, with obvious modifications, to partial derivatives. One 

^z 

additional point, however, must be stressed. The value of ^ at a 

ox 

point (x, y) depends, not only on the value of x from which we start 
in the limiting variation defining the partial derivative, but also on 
the value of y which is kept constant during the limiting process. 
The partial derivative, as the W.E. gradient of the surface 2 =/(x, y) 
^ at the point P, varies in value when the point P is moved in any way, 
i.e. when either or both of the variables x and y are varied. Similar 

dz 

remarks apply to the partial derivative — . Hence, the value of each 

partial derivative at a point (x, y) is a function of both x and y. To 
stress this point, we can refer to the partial derived functions and use 
the particularly appropriate notations /a-' (x, y) and ff x, y). 

No essentially new idea is involved in the definition of the partial 
derivatives and they can be evaluated exactly as ordinary deriva- 
tives. The variable other tha;n the one directly concerned must, of 
course, be treated as a constant in the actual process of derivation. 
In particular, the rules for finding the partial derivatives of composite 
functions are similar to those for derivatives. Thef modified form of 
the function of a function rule, however, merits separate notice. It 
can be expressed : 

• 

If z is a single-valued function of u where u is a single-valued 
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funotion of x and y, then z is a function of x and y with the partial 
derivatives 

dz _ dz du dz ^ dz du 

dx du dx dy du dy 

In particular, if u is any single-valued function of x and y, we have 


d . du 

— u” = nu^-^ 


du 


and -logM = - 


1 du 


dx'" dx' dx '' dx dx^''^'" u dx 

and similar forms for the partial derivatives with respect to y. 

The practical technique of partial derivation is fuUy illustrated by 
the following examples : 

Ex. 1. ^(2z-3^ + l)=2 ; |^(2a; -3y + 1) = -3. 

Ex. 2. ^(** + 2a;?/-^*)=2(a;+2/) ; ^(x’‘ +2zt/ -i/^) =2(x -i/). 

Ex. 3. For any fixed values of the coefficients a, b, c, , 

d d 

g~(ax4-6t/ + c)=a; ~^-{az + by+c)=b ; 

{ax^ + 2hxy + by^ + 2gx + 2fy -he) =2{ax ~\-hy -^g) ; 
d 


3y 


(ax^ + 2hxy + by^ + 2gx + 2fy + c) = 2 (hx + by -f / ). 


2x(x -p + 1) -x^ _x(x - 2y + 2) 


(x-y + l)* 


(x-y + \f ' 


-f- 

dy \x 


.2 


(x-y + l)^ 


Ex. 6. ^ (a:® + y®)* = 4*(a:® + y®) ; ^ (x® + y*)® = 4y (a:* + y®). 

ox cy 


^ e®*+v* = 2a-e** + : .f- = 2ye*’+»*, 

dx* • dy 


d^ 

dx 


2x d 

log(a:* + y®)==-,;j^; ^log(x®+y®) = 


x®+y*‘ 


Ex. 6. Z- 'x®e») = 2xe'' ; {xH*) = x®e». 

dx' dy 
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Ex. 7. Ifz-=®*A/-^> then 
^ x-y 


Similarly 


log z =2 log * + i log(a: +y) - i log (x-y). 

„ 19z_2 11 11 2a:* -xy - 2y* 

zdx x^2x+y 2x-y x{x*-y*) * 

. dz_x(2x* — xy -2y*) 

dx J(x + y)(x-y)* 

Similarly ^ ■ i— . 

9y J{x + y){x-y)* 

Ex. 8. The definition of the 'partial elasticities of a function of two 
variables z = /(x, y) is similar to that of the elasticity of a function of one 
variable : 

Ez X dz_d(log z) - Ez ydz_d{\ogz) 

Ex^ zdx^' d(\ogx) Ey'^zdy 5(logy)* 

So, if z^xf^^t where a and p are constants, then 
log z = a log a; + jS log y 


Ez j 

and 


Ey 


12.2 Partial derivatives of the second and higher orders. 

, The two partial derivatives of a function z=f{x, y) are themselves 
fimctions of x and y. The partial derivation process can thus be 
repeated and we obtain two partial derivatives from each of the 
partial derivatives of z, i.e. four second-order partial derivatives of the 
original function z. By an extension of the notation, we can write 
the new second-order partial derivatives as 

dx^ dx \dxj * dy dx dy \dx) * dx dy dx \dy} dy^ ~~ dy \dyl ’ 

Alternatively, if /.'(a;, y) and y) denote the (first-order) partial 
derivatives, the second-order partial derivatives appear as 

y) > fyx{^9 y) I fxy(^i y) y) 

or, more shortly, as • ^ 

f XX 9 fyx 9 fxy and 

The order of the suflSxes indicates the order in which the partial 

d^z 

derivations are carried out. Thus, =/'i (x, y) =/,, is the 

oy ox 
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second-order partial derivative obtained by partial derivation first 
with respect to x and then with respect to y. 

It is to be noticed that the two ‘‘ cross partial derivatives 

and g^g^ are quite distinct in meaning and there is no a priori 

reason to assume that they have the same value at any point. A 
result can be established, however, stating that the two “ cross 
partial derivatives are identical in value, provided that certain con- 
ditions relating to the continuity of the function are satisfied.* 
This result, known as Young’s Theorem, will not be proved here but 
it is verified below in the cases of two particular functions. It is, in 
fact, safe to assume that, for any ordinary continuous function, the 
order of partial derivation is immaterial i 

dH _ dH 
dy dx dx dy 


The following examples illustrate : 


Ex. 1. 


From 


3 / 

dx\x - 




1)- 


x(x-2y + 2) 
(x-y + l)5' 


, we derive 


3* / x« \ 1 

3x*\x-y + l/ (x-y + 1)* 

X {(2x -2y + 2)(x -y + 1)* -x(x -2j/ +2)2{x -y+ 1)} 

3 * / X* \ 1 

3y 3x \x - y + 1/ (x - y + 1 )< 

x{(-2x)(x-y + l)»+x(x-2y + 2)2(x-y + l)}= 


From 



, we derive 

(x-y + l)*’ 


. . 2x(y-l) 

“ (x-y + l)»’ 

3 V x« \ , 3 f 1 •> 2x» 

3y*\x-y + l/ * 5yl(x-y + l)*/ (x-y + l)*’ 


♦ See Coiirant, Differential and Integral Calculus^ Vol. II (1936), pp. 66-7. 
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2* 


2y 


Ex. 2. Since ^log(x»+y») and ^log(a:* +y*) 


we have 


ay 


^jlog(a;*+y*) 
^^-log(a:»+y*)=2*|- 
log(a:*+«/*)=2?/|- 


(a:*+j/2)a 

2y 


_ 1 *_ 

dxdy 

a* 


ay’ 


jjlog(a:*+y*) = 


2 (**+«/*) -4** _2(x*-y*) 

}- 

)“ {x^+y^)*' 

2(x^ - y^) 


(a;2 +2/2)2 

^ 

(a;^ +1/2)2 

2 (a;2 + y^) - 4y2 

^-7y2\2 


(a:2+y2)2 » 

(a;2 + ?/2)2 ’ 

4xy 


(x^~y^y 


Ex. 3. The partial derivatives of a function <f) (z, y) are 0* 0^. It 

is given that the partial derivatives of the ratio of <^3. to considered as 
a function of x and y, are respectively negative and positive : 


dX \ (f>yj 


<0 and 


If'tA 

dy \ 


>0. 


It is required to express these inequalities in terms of the first and 
second -order partial derivatives of <f>{x, y).* We have 




^v^xx - 

^,2 

^X^VV ~ 

^,2 


The inequalities are, therefore, equivalent to the forms 

i^y^XX ~ 4*X^XV^^ and (f)g(f)yy — ^y<^a;y<C0. 

Each of the four second-order partial derivatives is a function of 
X and y and, by a further partial derivation process, we obtain eight 
palatial derivatives of the third order. We can then proceed to 
partial derivatives of the fourth and higher orders. The notations 
we have given clearly extend to partial derivatives of order higher 
than the second. Young’s Theorem can also be extended to show 
that the order of partial derivation is immaterial in a partial deriva- 
tive of any order whatever. Partial derivatives of the third and 
higher orders are, however, of little practical use ami wa shall seldom 
have Aeed of them in the present development. 


♦ The problem is taken from Pareto, Manuel d'iconomie politique (2nd Ed. 
192i), p. 676, where the mathematical steps given here are omitted. The 
function <l>{x y) is b. utility function of an individual for two goods X emd F. 
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12.3 The signs of partial derivatives. 

dz dz 

The definition of partial derivatives shows that and — must 

ox dy 

measure the rates of change of the function z=f{x, y) as x varies 
{y constant) and as y varies {x constant) from the point (x, y). 
Further, as we have seen, these partial derivatives measure the 
gradients of the surface z=f{Xy y) in two perpendicular directions. 
The meaning of the signs assumed by the partial derivatives is now 
clear. We have, for the partial derivative with respect to x : 

dz 

(1) If ~>0 at the point {a^ b)y then the function z=f{Xyy) 

OX 

increases as x increases from the value a (y remaining equal 
to the value 6) and the surface z —f {x, y) rises from W. to E. 
at the point where x=a and y = b. 

dz 

(2) If — <0 at the point (a, 6), then the function decreases and 

ox 

the surface falls in the direction named above. 
dz 

The sign of ~ at the point (a, 6) is to be interpreted in a similar way 

and refers to the variation of y from the value 6 (x remaining equal 
to the value a) and to the path on the surface through the point 
where x = a and y = b in the S.N. direction. The signs of the partial 
derivatives thus determine, at various points, the increasing or 
decreasing nature of the function, the rising or falling nature of the 
surface, in two particular and important directions. There remain, 
for later detailed consideration (14.1 below), the cases where one or 
both partial derivatives assume zero values. 

The values of the two ‘‘ direct second-order partial derivatives 
of z—f(Xy y) have also quite simple interpretations. The partial 

derivative measures the rate of change of ~ as a; increases from 
dx^ ^ dx 

the point in<»que3tion (y remaining constant), i.e. it measmes the 

“ acceleration of the function z=f(x,y) as x increases and y 

dH 

remains constant. In diagrammatic terms, — measures the rate 

ox # 

of change of the W.E. gradient of the surface z~j{Xy y) as we move 
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from the point concerned in the same W.E. direction, i.e. it deter- 
mines the curvature of the surface in the W.E. direction. So : 

dH 

(1) If ^>0 at the point (a, 6), then the function z=f{x,y) 

changes at an increasing rate as x increases from the value a 
(y remaining equal to b) and the surface z=f{x, y) is convex 
from below in the W.E. direction at the point (a, 6). 

dH 

(2) If — <0 at the point (a, 6), then the function changes at a 

OX 

decreasing rate and the surface is concave from below in the 
direction named above. 

d’^z . 
dy^ 

It is more diflScult to interpret the value of the “ cross second- 

dH 

order partial derivative. The value of , at any point measures 


The interpretation of the value and sign of ^ ^ exactly similar. 


dz 


dx dy 


both the rate of change of — as y increases (x remaining constant) 

OX 

dz 

and the rate of change of -- as a; increases {y remaining constant). 

dy 

In diagrammatic terms, the value indicates the way in which the 
W.E. gradient of the surface z=/(a;, y) changes as we move in the 
S.N. direction, and the way in which the S.N. gradient changes as 

d^z 

we move in the W.E. direction. So, if - — - is positive at a point P 

ox ay 

on the surface, the W.E. gradient increases as we move N. from P 
and the S.N. gradient of the surface increases as we move E. from P. 
It is difficult to visualise this state of affairs. For example, if the 
surface rises to the right and falls to the left as we look N. from P, 

dH 

then the positive sign of - implies that the surface tends to rise 

ox oy 

more rapidly to the right and to fall more steeply to the left as we 
actually move N. from P. The S.N. path on the surface through P 
moves on an increasingly precipitous incline to right aJnd left. The 

dH 

opposite result holds if — — is negative at P. These properties of 

tha surface are quite different from, and independent of, the separate 
curvatures of the surface in the W.E. and S.N. directions. 
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One important point must be stressed at this stage. The various 
partial derivatives of a function serve to indicate the rising and 
falling nature and the curvature of the surface representing the 
fxmction — but only in two particular directions, the “ fundamental 
directions in planes perpendicular to Ox and Oy. Nothing is said 
about the other directions on the surface, directions in which both x 

and y vary. For example, the fact that both — and — are positive 

dx^ dy^ 

at a point P on the surface z=f{Xy y) implies that the surface is 
convex from below in the W.E. direction and in the S.N. direction 
at P. It does not imply that the surface is also convex from below in 
any other direction (say the S.W. to N.E. direction) at P. In fact, 
it is quite possible that there is a “ dip in the surface between the 
two fundamental directions, so that it appears concave from below 
in the S.W. to N.E. direction. 


The results obtained above clearly extend to allow for the cases 
where any or all of the variables are measured on logarithmic, 
instead of on natural, scales. For example, 

d (log z) _^ldz 
dx ~~ zdx 

measures the rate of proportional change of z=f{Xy y) for actual 
changes m x {y constant) and is shown by the W.E. gradient of the 
surface obtained when z is taken on a logarithmic and x and y on 
natural scales. Again, the partial elasticity 

Ez _d (log z) _^x dz 
Ex d (log x) z dx 

measures the proportional change in z for proportional changes in x 
(y constant) and is shown by the gradient in the W.E. direction of 
the surface obtained when all variables are plotted on logarithmic 
scales. 


12.4 The tangent plane to a surface. 

The simplest twp-dimensional curve is a straight line, represented 
by a linear equation in two variables. The simplest surface in three 
dimensions is the plane and we can show that it is represented, as 
we expect, by a hnear equation in three variables. Geometrical 
considerations (similar to those of 3.1 above) establish the simple 
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result that, if two points in space have co-ordinates (Zi, y^, z^) and 
(®*. y*. 2j) referred to rectangular axes Oxyz, then the mid-point 

Suppose, 


between them has co-ordinates 


now, that (Xi, t/i, Zi) and (X 2 , y^y ^ 2 ) any two points on the surface 
(whatever its form may be) corresponding to the linear equation 

ax-^by -\-cz=d 
where a, 6, c and d are constants. Then 

aXi-\-byi-{-cZi=:d and ax 2 + by 2 -^cz 2 —d. 

Adding, and dividing through by 2, we find 



i.e. the mid-point of the two selected points also lies on the surface. 
This is true of any selected pair of points on the surface and it follows 



that the latter must be a plane. The general linear equation in three 
variables thus represents a plane. The actual location in space of 
a plane whose equation is given is easily determined by finding the 
points A, B and G where the plane cuts the axes (see Fig. 82). On 
the axis Oa;, we have ^=2 = 0. Substituting in the equation of the 
plane, ax=:d. So 



a 
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Similarly 


OB = -, and 


OC=-. 

c 


The cases where one (or more) of the constants a, 6 and c has a zero 
value clearly correspond to planes parallel to one of the axes. 

A plane is given as passing through the fixed point (x^, 

If the equation of the plane is 

ax i-by ■\-cz=d, 
then ax^ + bij^ -f czj = d. 

It follows that d can be eliminated so that the equation of the plane 
appears 

a(x-x^)+b{y-y^) 


or 




■p- 


C 


(c^O). 


Hence 


dz a . 
^ = - - and 
ox c 


dz b 
dy^ c 

are the gradients of the plane (referred to the horizontal plane Oxy) 
in the two fundamental directions perpendicular to Oy and to Ox 
respectively. The following result is thus obtained : 

The equation of the plane passing through the point (x^, Zj) 
with gradients a and jS in the fundamental directions is 
z-Zi = a{x-x^)+p(y-yi). 


In general, a plane through a given point P on the surface z =/(x, y) 
cuts the surface in a curved section on which the point P lies. If it 
is possible to vary the position of the plane so that the part of the 
section including P encloses a smaller and smaller area and finally 
tends to close down on P itself, then the limiting position of the plane 
can be described as the tangent plane to the surface at P* Suppose 
that the tangent plane exists at P and that a section of the surface 
and its tangent plane is taken by a plane through P perpendicular 

♦ The possibilities are rather more complicated than this brief statement 
indicates. It may happen, of course, that no such limiting plane exists at all 
and the surface may have no tangent plane at a point whore there is a “ sharp 
point ” or an “ edge of the surface. Further, it may happen that the section 
tends, not to a point, but to a straight line through P. The limiting pltine is 
then a tangent plane touching the surface at all points on the straight line 
through P. In any cose, it should be noticed that the section of the surface 
by the tangent plane can consist of a curve as well as the isolated point P. 
Only one part of the section need close down on the point P. ^ 
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to Oy, We obtain, from the surface, the curved path in the W.E. 
direction and, from the tangent plane, the tangent PT^ to the path 
at P (see Fig. 81). It follows that the tangent plane, containing the 
line -PTa, has gradient in the direction of the plane perpendicular to 

dz 

Oy given by the value of ^ at P. In the same way, the gradient of 

OX 

the tangent plane in the other fundamental direction is given by the 
dz 

value of — at P. If the point P has co-ordinates (x^y y^, z^), the 
dy 

fundamental gradients of the tangent plane at P can be denoted : 

So I 


The equation of the tangent plane at the point y^y z^) on the 
surface 2 =/ (a;, y) is 




This is an obvious extension of the form given for the tar gent line 

to a two-dimensional curve (6.5 above). 

As an example, the paraboloid surface z=x^-{-y^ gives 

dz ^ j 92 _ 

-~^2x and ^ = 2y. 
dx dy 

The tangent plane at ( 0 :^ 2 ^) has equation 

2 - = 2^1 {x - x^) + 2yi {y ~ y^), 

i.e. z = 2xx^ + 2yyi -f - 2 (x^^ + y^^). 

Since the point must lie on the surface, we have z^=x^-^ y^* and 

z-hZi = 2a;a:i-f2yyi 

is the equation of the tangent plane. 

d'* dz 

An interesting result can now be deduced. If ~ and — are both 

dx dy 

positive at a point P on the surface z=/(a:, y), ,theii the tangent 
plane* at P is positively inclined to the plane Oxy in the two funda- 
mental directions, and so in any direction in which x and y increase 

dz dz 

together. Hence, in the particular case in which — and ^ are both 
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positive, the function z=f{Xj y) increases whenever x and y both 
increase. The converse result holds in the particular case when both 
partial derivatives are negative. 


12.5 Partial derivatives of functions of more than two variables. 

The definition of partial derivatives extends readily to cases ot 
functions of more than two variables. If u=f(x, y, z) is a single- 
valued function of three variables, then a function of one variable 
is obtained whenever two of the variables are given fixed values. 
This can be done in three different ways and we have as a function 
of X only {y and z fixed), of y only {z and x fixed) and of z only 
{x and y fixed). Hence, we can define three partial derivatives, 
du du ^ du 
dz '' 

— =iAm /(^+^» z)-/(^. y. g) ^ 

dx h 


“ , — and — , of the function at any point (x, y, z), e.g. 


Then three second-order partial derivatives are obtained from each of 
the first-order partial derivatives, making a group of nine in all. For 
all ordinary continuous functions, however. Young's Theorem asserts 
that the order of partial derivation is immaterial and only three ,of 
the six “ cross " second-order partial derivatives are distinct. So we 


have only tliree “direct” partial derivatives ^ 

and three “cross” partial derivatives and 

\dx oy oy dz dz dxJ 

The alternative notations for partial derivatives also extend, in an 
obvious way, to this three -variable case. Finally, if necessary, we 
can obtain partial derivatives of higher order than the second by 
further derivation processes. 

In the general case of a single-valued function of n variables, 
y=f{Xiy a^a, ... there are n first-order partial derivatives 


Each corresponds to the variation of y as one of the independent 
variables changes, the other (n - 1) variables remaining constant. 
There are then second-order partial derivatives but this numboj is 
reduced in the case of ordinary continuous functions, since Young's 
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Theorem shows that the order of partial derivation is immaterial. 
There are, in fact, n “ direct ” partial derivatives 

d^y d^y V^y 

and Jw(n - 1) ‘‘ cross partial derivatives 

d’^y d^y d^y d^y 

dxidx2 * dxidx^ * *’* dx^dx^ ’ dx^dx^ ’ ’ * dx^dx^ ’ *’* * 

The analytical interpretation of the partial derivatives extends at 
once. The first-order partial derivatives, evaluated at a given point, 
measure the rates of change of the function as one of the variables 
increases from the given value, the other variables remaining 
dtt 

constant. Thus, if — is positive at the point (a, 6, c), then the 
ox 

function u=f{x, t/, z) increases as x increases from a, y and z having 
the fixed values b and c respectively. Further, the “ direct ** second- 
order partial derivatives measure the various “ accelerations of the 
function, one of the variables increasing from the given value and 
the others remaining constant. It is not possible, of course, to give 
any concrete diagrammatic interpretation of partial derivatives 
when there are more than two independent variables. 


12.6 Economic applications of partial derivatives. 

Under the conditions of 11.7 above, the market demand for any 
good X,. is a function of all market prices : 


Xf. — ^r(Pl9 Pty ••• Pr^ iX — ^ ••• 

The partial derivatives of this function indicate the variations of 
demand as one of the prices varies, other prices remaining constant. 

dx 

The partial derivative ^ , which must be negative in the normal 

case, shows the rate at which the demand for decreases as its 
price increases. It is usually convenient to put this rate into elas- 
ticity form — the partial elasticity of demand for X, with respect to 
its price I 

^ a(loga:r) ^ PrSXr 
3(logP,) Xrdpr‘ 


This expression, independent of demand or price units, shows the 
rate of proportional decrease of demand for proportional inoxeases 
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in price. It is an extension of the ordinary demand elasticity 
obtained when demand is regarded as dependent only on the price 
of the good concerned. The ordinary demand elasticity is defined 
only when the prices of other goods are fixed and known ; if these 
prices change, the whole demand law shifts and the elasticity must 
be evaluated anew. But our generahsed demand law and the 
demand elasticity obtained from it automatically allow for these 
shifts in demand. In fact, is a function, not only of the price 
but also of all other prices ; its value is thus seen to change when 
any of the prices change. 


' , measuring the rate of change in the demand for X, for increases 


doc 

The partial derivative measures the rate of change in the 

demand for X,. when the price of another good X, is increased, and it 
is best considered in relation to the corresponding partial derivative 
dx^ 

in the price of X,.. If both partial derivatives are positive, then the 
demand for X,. increases with the price of X, and the demand for X, 
increases with the price of The goods X^ and X, are then 
competitive, using this term in a broad sense. If both partial deriva- 
tives are negative, the demand for one good changes in the sense 
opposite to that of the price of the other good and the goods can be, 
called, in the same broad way, complementary goods.* It is often 
convenient, again, to use the elasticity forms 

^ 3(lQga;,.) ^ p,dXr^ ^ d(\ogx,) ^ Prdx, 

3(logp,) x,dp,’ d(logPr) x.dp/ 


i.e. the partial elasticities of demand for one good with respect to 
the price of another good. 

If the demand laws for two goods Xj and X 2 are linear : 


~ ^hlPl ^2^2 > 3^2 = 0^2 ^2lPl ~ ^22pit 

then the price elasticities of demand are 

Pi P 2 Pi j P^ 

’/ai— “®2i;;r V22~^22~ 

w^l • ^1 *^2 *^2 

and all depend on the prices assumed for both goods. 


* There remains the possibility that the partial derivatives are of opposite 
signs. In this case, the goods, on the present definition, are neither competitive 
nor complementary. See Schultz, Interrelationa of Demand, Journal of 
Political Economy, 1933, and 19.7 below. 
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On the other hand, if the demand laws are of the form 

then we have linear relations when all variables are taken on log- 
arithmic scales and the price elasticities of demand are constants : 

^11 ~ > V2I ~ ” ®21 ^22 ~^22* 

It is to be noticed that the two elasticities of demand for one good 
with respect to the price of the other are of the same sign, in each of 
the above cases, only if and agi are of the same sign. 

The production function x=f{ai, ag, aj, ... a„) shows the depend- 
ence of the output of a good X on the amounts of the variable 
factors Ai, A2, ... An used in the production. Suppose that a 



particular combination (Oj, ag, a^, ... a„) of the factors is used and 
that all factors except A^ are regarded as fixed in amount. The ratio 

3 / 

— (the amount of product per unit of the factor Ai) can be called the 

^ dx 

average product and the partial derivative — the marginal product 

oa^ 

of the factor A^ at the combination (o^, Ug, Ug, ... a„). The latter 
measures the rate of increase of output as the factor A^ is increased, 
the amoimts of other factors remaining unchanged. Both average 
and marginal products are functions of o^, a*, a,, ... an, i.e. depend 
on the grouping of the factors considered. Average und marginal 
products of the other factors can be defined in similar ways. 

With two variable factors A and B, average and marginal products 
oanr*be shown diagrammatically on the vertical sections of the pro- 
duction surface x=f{a, b). The section of the surface by the plane 
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6 shows the variation of product for a variable use of the factor A 
with a fixed amount of the factor JS. If P (a, b^) is a point on this 
section, then the average product of A at (a, b^) is shown by the 
gradient (referred to Oa) of the line OF and the marginal product of A 
by the gradient (to Oa) of the tangent FT. In the normal case of 
production (see 11.8 above), the section, which can be called the 
product ciu*ve for the factor A, takes the form shown in the first 
diagram of Fig. 83. A second diagram can now be constructed 
showing the variation of average and marginal products (see Fig. 83). 
Average product increases to a maximum at a definite point a=a 2 
where the tangent to the product curve passes through the origin. 
Average product equals marginal product at this position. Further, 
at some larger amount a^a^, product is a maximum and marginal 
product zero. All three curves depend on the fixed amount 6 i of the 
factor B used and change in position and form when this amount 
bi is changed. 

In the particular case where the production function is 

X = 2Hab - Aa^ - Bb^ (H^ > AB), 

then - = 2Hb-Aa-B- and ^ = 2lHb-Aa). 

a a da . 


A .nd 

da^ \a 


b^ 


Hence, for the fixed amount 6 ^ of the factor B, average product of A 

is a maximum when a = VI bu and both average and marginal 

products are then equal to 2{H - jAB)bi. In this case, the “ opti- 
mum ” use of the factor A and the maximised average product 
increase in proportion to the fixed amount of the factor B used. The 
marginal product curve for A, when b=b„ is a downward sloping 

straight line cutting Oa where a = — 6 ^, product here being a maxi- 

XX 

mum. The average and marginal product curves are of the normal 
form of Fig. ft3. • 

If w = ^ 2 , ^ 3 , ... a;„) is one form of an individuars utility 

function for consumers’ goods X 2 , X 3 , ... X,,, it might appear 

, , du du dll du . 

that the partial derivatives tt” > ^ ^ — represent The 

C/Xj (jXf^ C/X3 
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marginal utilities of the various goods to the individual at a 

duL 

set of purchases x^y x^y ... For example, — measures the 

rate of increase of utility when the purchase of Xj is increased, the 
purchases of other goods remaining unchanged. But we have seen 
that utility is a non-measurable magnitude to be indicated in 
general by 

u =F{u)=F{<f>{xi, Xj, Xj, ... x„)} 


where F{u) is any function such that F'(u)>0, Can any meaning 
be attached, therefore, to increments of utility and so to “ marginal 
utility ** ? We have 


dxi dxi ' 




dx, 


djl 

dXyy' 


’-F\u) 


du 

dXyy 


i.e. marginal utilities involve the arbitrary function F and lose their 
definiteness when utility is regarded as non-measurable. But 


dxx * dx^ ’ dx^ dXn dxi ’ 9^2 ’ dx^ 


and the ratios of the marginal utilities are definite concepts quite 
independent of the non-measurability of utihty. This fact will be 
developed in the following chapter (13.7 below). 

Suppose, for example, that one form of the utility function is 




where only two goods X and Y are considered by the individual. 
Then 

^ = oc{x-\-a)^-^{y + by and ^=iS(a;-f a)«(y + 6)^-^ 


i.e. 


du du 




dx dy x-{-a* y 
Another form of the same utility function is 

1 ^' = a log (a: + a) +i8 log {y + 6), 


whence 


du^ a 
dx x-\-a 


and 


du' p 


Tffe ratio of marginal utilities. 


dy y + 5 


x + a ' y + b 


, is thus perfectly definite. 
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12.7 Homogeneous functions. 

When the variables x and y are increased or decreased in a fixed 
proportion from given values, the corresponding increase or decrease 
in the function z=f(Xy y) may be in greater, in equal or in less pro- 
portion. In the very special case where z=:f{x, y) increases or de- 
creases always in the same proportion as x and y, the function is said 
to be homogeneous of the first degree, or to be linear and homogeneous, 
&uch a function has the property that its value is doubled whenever 
the values of the independent variables are simultaneously doubled 
from any position whatever. And a similar result holds for any 
ratio other than that of doubling. Hence : 

Definition : z—f{Xy y) is a linear homogeneous function if 
/(Aa;, Ay) =Af(Xj y) for any point (x, y) and for any value of A what- 
ever. 


To quote particular examples, each of the following function types 
is linear and homogeneous : 

( 1 ) z^ax^-by, ( 2 ) z=^axf^7/-^, 

-f 2hxy + 


(3) z = Jax^ -{-2hxy 


(4) 


cx^-dy 


where the coefficients, a, 6, c, ... , and the index a are constants. 
The fact that z changes proportionally for proportional changes in 
X and y can be verified in each case. 

If P is a given point on the surface representing a linear homo- 
geneous function, then any point with co-ordinates proportional to 
those of P also lies on the surface. The points (x, t/, z), {^x, ly, \z) 
and (2a;, 2y, 2z) all lie on the surface if one of them does. But all 
points of this nature lie on the straight line OP joining the origin to 
the given point P. In fact, the line joining 0 to any point on a linear 
homogeneous surface lies entirely in the surface. The surface is 
completely described by fines passing through the origin and is a 
particular case of what is called a ruled surface. It also follows that 
the tangent plane at P touches the surface, not only at P, but also 
at all points on the line OP, A cone with vertex at the origin is an 
easily visualised example of the type of surface corresponding 4o a 
linear homogeneous functioni 
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The sections of a linear homogeneous surface by planes perpen- 
dicular to an axis also reflect its special properties. If P is a point 
{x, y) on the contour by the plane 2=2i, then the point Q (2a:, 2y) 
must lie on the contour by the plane z = 2zi. But the points P and Q 
lie on a line through 0 so that 0Q=^20P. Hence, any radius 
through 0 in the plane Oxy cuts the 7^ and 22:1 contours in points of 
which the second is twice as far from 0 as the first. The contour 22:1 
is of exactly the same shape as the contour but radially double in 
size. A similar result holds for any pair of contours : 

All contours of the linear homogeneous surface are similar in shape 
and vary in size radially according to the ratios of the fixed values 
of z defining the contours. 


Any one contour is a radial “ projection of any other and the whole 
system of contours can be derived given any one contour. Further, 
the tangents to the various contours at points where they are cut by 
a given radius through 0 must aU be parallel. 

Exactly similar results hold of sections of the linear homogeneous 
surface by planes perpendicular to Ox or Oy, The various sections 
of such a system are radial projections of any one section of the 
system and vary in size according to the fixed values of a: or y 
defining the sections. Also, from the ruled surface property, the 
vertical section of the linear homogeneous surface by a plane through 
0 and a point P on the surface consists of a straight line. 

The linear homogeneous function is the simplest case of a wider 
class of homogeneous functions. More generally, if 

/(Ax, Ay)=A7{x, y) 

for any point {x, y) and for any value of A, then the function z =f{x, y) 

is homogeneous of the rth degree. For example, 2 = is 

homogeneous of zero degree, z=ax^ ■^2hxy is homogeneous of 
the second degree and z=ax^y^ is homogeneous of degree (a+j3). 
The diagrammatic properties of a general homogeneous function can 
be illustrated in the quadratic case (r = 2). The vefrtical section of a 
quadratic homogeneous surface by a plane through 0 and a given 
point on the surface is always a parabola with vertex at 0 and axis 
along Oz. The contoms of the surface are radial projections of each 
other and vary in size according to the square roots of the values of z 
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defining them. A similar property holds also of the v’ertical sections 
of the surface by planes perpendicular to Ox or Oy. 

The definition of homogeneous functions extends without diflB- 
culty to the case of functions of more than two variables. The 
function y=f(xi, 0 ^ 2 , ...) is homogeneous of the rth degree if 

f {^X-j^j A^2> •••) 

for any point {x^y x^y Xg, ...) and for any value of A. The c ase r = 1 
gives the linear homogeneous function, and h ere the proportional 
change in the values of all the independent v ana bles produces gm 
equal proportional change in the value of y. 


12.8 Euler’s Theorem and other properties of homogeneous functions. 

If z=f{Xy y) is a linear homogeneous function of two variables, 
then the following properties can be shown to hold at any point 
(x, y) whatever : 

(1) The function can be written in either of the forms 




y! 

where <f) and tfi are some functions of a single variable. 
dz dz 

(2) The partial derivatives -- and — are functions of the ratift of 


X to * 

(3) Euler's Theorem : 


By 


dz dz 


(4) The direct second-order partial derivatives are expressed in 
terms of the cross second -order partial derivative 
dH y dH d’^z^_^ x dH 

dx^ X dx dy dy^ y dx dy 

The proofs of these results can be set out formally : 

Since /(Ax, Ay) = A/(x, y) for any value of A, 


we have 


i.e. 


since 


/ ( 1 , 1 ) is a function of | only. Similarly, for A=- , 

*)=!'*©■ 
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Finding the partial derivative of z=x<f> with respect to x t 


dz 

dx 


= < l > (-1 


xj dx \x 




x ^ \x 


where <f>' denotes the derivative of the single variable function 

<!>[-] with respect to - • Further, 

\x/ X 






dz dz y 

Hence, both — and — appear as functions of the ratio - only. 

dx dy ^ ' 


Now, 


dz dz 
dx 






X/ ^ 


which is Euler’s Theorem! Finally^UTproVe The last result, we 
notice that Euler’s Theorem holds for any values of x and y whatever, 
dz dz 

Xi^ + y T" identically equal to z. | This identity is maintained 


no matter how x and y are varied and it follows that the partial 
derivatives of one expression are equal to the corresponding partial 
derivatives of the other.’*' Hence : 

d I dz dz\ dz 


i.e. 


dx 


dz dH dH dz 
dx^^ dx^^^ dxdy dx* 


^ , i.e. 


d^z 


y dH 


X dx dy 

A similar result is obtained by partial derivation with respect to y. 


♦ The equation of the derivatives of each side of an identity is a device of 
wide application (see, e.g., 14.8 below). Before applying the device, it is 
essential to establish that the relation used holds identically for all the 
variables with respect to which the derivation is carried out. A relation 
holding only for certain values of the variables cannot be treated in this way. 
Two examples make this clear. For all values of a?, we hav^e 

(1 +a;)*= 1 + 2a; + a;*. 

Hence “(1 +a;)* = ”(l + 2a? + a;*) = 2(1 -f a;). 

But ^ a;* + 2 = 3x * 

holds only for the particular values a? = 1 and a? = 2. Hence 

£(*■<- 

*£• 

as can be verified at a? = 1 or a? = 2. 
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The truth of Euler’s Theorem is also clear in diagrammatic terms. 
The tangent plane (12.4 above) at the point {x^, Zj) on the surface 
z—f(x, y) has equation 

*-*■=©.<’'-*■' + ( 1 ), 

It passes through the ori ^iu^ if x = y = z—0 satisfies the equation, 

Euler’s Theorem asserts that this condition is satisfied at all points 
ona line aFTiomogeneou s surface. The tangent plane thus passes 
thr ough ^ e origin ^ jail points, a iact which agrees with thejruled 
sur face property of linear homogeryous surfaces. 

The four properties generalise to the case of aTunction z =/(x, y) 
which is homogeneous of degree r : 


( 1 ) z = x^</> 


At)- 


are homogeneous of degree (r - 1). 


/„, dz ( 

( 3 ) ^^+y] 


(4) + + 


:r(r ~ l)z. 


The first of these extended results is established, exactly as before, 
by using the definition of a homogeneous function. The second and 
third results then follow by derivation. The fourth result is obtained 
from the third result, which is identically true for all values of x and 
y, by finding the separate partial derivatives : 


d I dz dz\ 
to V 


Similarly 


d^z d^z , , . dz 

d^z d^z . , , dz 


Multiplying by x and y respectively and adding, 

, 9*z „ dh ,d^z , dz ^ d: 






■■r{r - l)z. 4. 
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Finally, the properties hold, in modified and extended forms, for 
homogeneous functions of more than two variables. It is not 
necessary to set out the properties in the completely general case 
here ; the nature of the extensions is suflSciently obvious.* 

12.9 The linear homogeneous production function. 

The case where the production function of a good X with respect 
to variable factors Aj, A,, ... A„ is linear and homogeneous is 
one of particular interest. From the definition and results (1) and (2) 
of 12.8 above, the case is characterised by the fact that a propor- 
tional increase in c^/jfactor s leads to a proportional increase in 
produ ct and leaves the average an d marginal product_of each factor 
u nalter ed.^ WeJ ^ve, in fact, the case ^^^jj^ns tant m tuyns to s^^^ 
wh ere only the relative amq untaj&fjbhp> fafif-nra ngAH ia importa nt and 
^ not th^ctual scale of pr oduction. For example, if wheat is produced 
with land and labour under constant returns to scale, then the wheat 
product is (e.g.) doubled when twice the number of men are employed 
on twice the area of land. Furthe r, the product o f wheat pe r man 
or per acre depends only on ^e number of men employed per acre , 
^as does the marginal product of wli^t per man or acre . 

In the case of two factors A and B and constant returns to scale, 
the production surface is subject to restrictions additional to the 
normal ones already indicated (11.8 above). The surface is ruled by 
straight lines through the origin and any section through Ox consists 
of a straight line. The constant product curves in the plane Oab are 
now radial projections of each other and vary in size according to 
the constant products which define them. In particular, any radius 
through 0 cuts the curves in points where the tangents are parallel. 
So, in the normal case of Fig. 78, the curves OA and OB, marking 
the points with tangents parallel to an axis, reduce to straight lines. 
The sections of the production surface by planes perpendicular to 
(e.g.) Ob are also radial projections of each other and, in the normal 
case of Fig. 79, the maximum points all lie on a straight line through 
0, i.e. as the amount used of the factor B increases, the maximum 
product obtainable by varying A increases in proportion. Further, 
there is one line through 0 which touches all sections, i.e. the 


See Examples XII, 24-0, and 18.6 below. 
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maximised value of the average product of A is the same no matter 
what fixed amount of £ is used. 

Euler’s Theorem states that, for any combination of factors, 


x=a 


dx dx 


and this can be illustrated diagrammatically (Fig. 84). P is the point 
on the production surface for the combination (a, 6) of factors and 
the tangent PQ to the section perpendicular to Ob is drawn. Then 


dx RP RP 

^=gr»d.entofP(3 = ^ = g2, 


i.e. a^=BP. 
da 


So 


b~,=x-a^~^MP-RP = MR. 
do da 



The total product MP is divided, 3fR=b and RP=a — • At the 

dx 

point A on the section where total product is a maximum, 

and so ^ = At such a point, the average product of P is a 
db b 

maximum on the other section perpendicular to Oa, Similarly, at 

the point B on the section where the average product of .4 is a 

dx iff dx • * 

maximum, — =— and so ^ =0, i.e. the total product is a maximum 

' da a db 

on the other section perpendicular to Oa, There is a correspondence 
between the maximised total product for variation of one facto^and 
the maximised average product for variation of the other factor. 
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The following are simple examples of linear homogeneous pro- 
duction functions corresponding to constant returns to scale ; 

(1) ; (2) x = > {Z) x=^j2Hab - Aa^ - Bb^, 

{H^>AB in the two latter cases.) In each case, it is easily verified 

( X x\ 

~ and and the marginal products 


and ^ j are fimctions of - only, i.e. depend only on the ratio of 

the factors used. Case (1) corresponds to the simpler case of normal 
production and cases (2) and (3) to the full normal case illustrated 
by Figs. 78 and 79. 


EXAMPLES XII 
Partial derivatives 

1. Find the first and second-order partial derivatives of each of the 

1 X 

functions «* + y* - Zxy ; - - and log . Verify Young’s Theorem 

Vic* + y* x + y 

that the order of partial derivation is immaterial in each case. 

2. Evaluate the partial elasticities of and + 

a X u \i "1“ c 

3. Obtain the partial derivatives of • 

o,a? + 6,2/ + c, 

A By logarithmic derivation, evaluate the partial derivatives of 

(x-f y)(a; + 22/) 

(x-y)(x-2y) 

0 0 

and show that + {(x-i-j/)'‘e®^*'} = (x + !/ + n)(x-f 

ox oy 

6. Show that {(a; + = (x + y + n)e^* 

and that = + 

X^ 

6. Show that « = and its partial derivatives of all orders become 

x-y 

infinite at any point where x — y, 

7. If z = /(w) where w is a function of x and y, show that 

. dz Sz dz dz . dz ^ X 

+ = (c)*^ + y.- = Oifu = -. 

Verify in the case where /(w) = log u, 

8a If z = ^ (w) + 0 (v), where ^ (w) is a single-valued function of u = x + ay and 

a*z 0*z 

^(vyiB a single-valued function of v = af '-ay, show that * Verify 

when 4* M ^ (v) = e*. oy ^ 
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9. Evaluate the first and second -order partial derivatives of 
w = (x* 4- 2xy - y*)e* 

and verify that the order of partial derivation is immaterial. 

10 . Show that the surface z — n/^ (x and y positive) rises and is concave 
from below in each of the fundamental directions. 

11. If ^>0 and “<0 at a point (x, y), show that z increases whenever x 
increases and y decreases from the point (x, t/). 

12. Find the equation of the plane passing through the points with co- 
ordinates (2, 1, 0), (- 1, 0, 4), and (1, 2, 1) referred to rectangular axes. 
Where does the plane cut the axes? 

13. If A is a parameter, show that the equation x 4- y + Az = 1 represents a 
system of planes intersecting in a given line in the co-ordinate plane Oxy, 

14. Show that the tangent plane at the point (x^, zj on the surface 
z = ^/xy has equation ocy^ 4 - t/x^ - 2zz^ — 0 . 

15. Find the equation of the tangent plane at the point (2, - 1, 2) on the 

surface z — — — Where does it cut the axes? Deduce that z decreases 
X4-1/ 

whenever x and y increase from the values x = 2 and 3 /= ~ 1 . 


Homogeneous functions 

16. Graph the contours of the surface z—^xy given by z = 2 and z = 4 and 
show that the latter is radially double the size of the former. 

X* 4* V* 

17. Show that the contours of the surface z = form a system of - 

circles which are radial projections of each other. 

18. Show that the following are linear homogeneous functions : 


X* + 2/* 


Express each in the form z = x<^ Obtain ^ and ^ and so verify Euler’s 

Theorem in each case. x y 

19. Verify property (4) of 12.8 in the case of z — ^J^, 

20. Show that z = (UjX 4 - h^y)^ (a,x 4* and z = 4- o,x«t 2 /i-«i are 

linear and homogeneous and verify Euler’s Theorem in each case. 

21. For each of the quadratic homogeneous functions z = x* - xt/ 4- 2y * and 
X* 4“ v* 

2 = ^ show that the partial derivatives are linear and homogeneous and 

^ “ 2 / az , ez 
verify that x — + y — = 2 z. 
dx dy 

^/ 22 . For the homogeneous function z = ax®yP, show that 
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23. If « is a function of x and y homogeneous of degree zero, show that z 

can be written as a function of - only and that - - -x- • Verify by con- 

X dy y ox 

. x-y , x*-xy-\r2y^ 

sidermg z = log and z = ;r-r . 

® + y 2x^^xy + y^ 

24. If y=f(Xi, a:,, a?,, ... af„) is homogeneous of degree r, show that 

^ ^ ^ xojj ’ ’ a?!-' ’ 

where ^ is some ftmction of (n - 1) variables. Deduce that the partial deriva- 
tives are homogeneous of degree (r - 1) and that 

dy dy dy dy 

25. If t/ is a linear homogeneous fimction of x^, ar,, ... x^, show that 

dx^* ajj \ * dx^dx^ * dx^dx^ ^ dx^dx^J 

and similar expressions for the other direct partial derivatives. 

26. Show that the relation between the second-order partial derivatives of 
a function of n variables, homogeneous of degree r, is 

d*y 


®»‘ 1^1 + *•’ + — + 2*,*, „ - 
^ dx^* ■ 0a;,* * *0a;i0x, 


-l-... = r(r- 1 ) 2 /. 


Economic applications of partial derivatives and homogeneom functions 

27. If the demand laws for two goods are given by (4) of 11.7, show that 
the “ direct ** price elasticities of demand are independent of the prices 
while the “ cross ” price elasticities are determined in sign by the constants 

^ o„ and a,i respectively. 

28. The employment of 100a men-hours on 6 acres of land gives 
a; = 2(12a6 - 6a* - 46*) bushels of wheat. Graph the average and marginal 
product curves for labour when 10 acres are cultivated. Compare with the 
product curve of Fig. 80. 


29. Draw the average and marginal product curves for labour employed 

40 

on (a) 10 acres and (6) 100 acres when a; = ^-^(12a6 - 6a* - 46*) is the 

wheat product for 100a men-hours on 6 acres. Verify that the maximum 
average product is the same in the two cases, ten times as much labour being 
required in the second case as compared with the first. 

For the linear homogeneous production function x = ^ 

show that the average and marginal products of the factors depend only on 
the ratio of the factors and verify that the product is^ always a times the 
marginal product of A plus 6 times the marginal product of B. 


, 31. The production function * = ^/ 2Hah - Aa* ~ Bb* is linear and homo 
geneous ; show that the maximum value of the average product of .4 is a 

oonaUnt^^*"^® 


B 


independent of the fixed amoimt of B used. 
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32. The production function is x = Aa^b^ where a + 1. Show that there 
are decreasing returns to scale and deduce that the total product is greater 
than o times the marginal product of A plus h times the marginal product of B, 

33. If a men are employed in planting b acres with timber, the amount of 

timber cut after t years is x =f(a, b, t). What meanings can be attached to 
Sx bx bx 

— , — and — T If a? = where a and B are fixed positive fractions, 

oa 00 ot 

show that there are constant returns to land and labour after a fixed time t 
and that, for given employments of land and labour, the timber product 
increases, but in a decreasing proportion, as time goes on. 

^ Find the ratio of the marginal utilities for two goods when the utility 
function is w = + + c>/xy. Verify that the same result is obtained when 

the utility function is written u' = \og(ax + by 


35. If U = F{u) is the index-function of utility where u = axy, show that 
(a) ^-^ = 0, when F(u) = u; (b) 4— <0, when F(u)^»Ju; and (c) 

03C OjC ox * 

when F(u) — e^, 


38. If <f>(x, y, z, is one form of the utility function for goods 
Z, Y, Zt ... and if U — F\u) is the index-function of utility, show that 


0*17 

dx^ 


dhi 



and similarly for the other second-order partial derivatives. Deduce that the 
sign of each of these derivatives varies, in general, with the choice of the 
arbitrary function F. Illustrate with the results of the previous example. 



CHAPTER Xm 


DIFFERENTIALS AND DIFFERENTIATION 


13.1 The variation of a function of two variables. 

There is only one way in which the value of a function of a single 
variable can change, i.e. by changing the value of the independent 
variable. The variation of the function is then adequately described 
by means of the derivative. The rate of change of y=f{x) from any 
point X is measured by /'(a;) and Ay—f'{x)Ax is the approximate 
increment in y for an arbitrary small increment Ax in x. 

The expression of the variation of a function of two variables is 
much less simple. The value of z=f{x, y) changes when x changes 

dz 

{y remaining constant) at a rate measured by — . Hence, ii A gZ 

€ (/X 

denotes the increment in z due to an arbitrary small increment Ax 
• in the variable x from a point (x, y), then 

AgZ=^Ax approximately. 


In the same way, if A yZ denotes the increment in z due to an 
arbitrary small increment Ay in the variable y from the point (x, y), 
then 


AyZ = ~Ay approximately. 


But these are only two special ways in which the value of the function 
can change and there remains the important problem of expressing 
the variation of the function when the independent variables vary 
together in any way whatever. A single partial derivative is not 
sufficient here and an addition to our mathematical equipment is 
needed. 

It is assumed that the function z=f{x, y) possesses continuous 
partial derivatives at the point (x, y). If h and k denote arbitrary 
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increments in the variables x and y from the point (a;, t/), the corre- 
sponding increment in the value of the function is 

Az =f{x + h,y + Ic) -f(x, y) 

^ f{x+h, y + k) -f{x, y + k) ^ ^ f{x, y + k) -f{x, y) ^ 

h k 

But, from the definition of a partial derivative, 

f(x + h, y + k)-f(x, y + k) d i « 

y -h k) as a->-0. 


dx 


/(a;, y) 


as k-^Oy 


since the partial derivative is continuous. It follows that 

f{x-\-hy y-{-k)-f(x, y + k) ^ , a i. ^ 7 a 

{ — ^ — - = ^f{^yy)+€> where c ~>0 as a and 

n ox 

Again, 


i.e. 


f(Xy y + k)-f{Xy y) ^ d 
k dy 

f(Xy y + k) -f(Xy y)_d 


f(Xy y) as k-i^Oy 


= ~/(x, j/) + 77 , where 77 ->0 as A; -►O, 
dy 


Hence, writing the partial derivatives in an alternative notation; 

Az = + €^ h+ + 77^ ky where € and 77->0 as h and . 

The expressions € and rj must be small when both h and k are small. 
So, changing the notation for the increments in x and t/, we have : 

The increment in the function z =f(Xy y) corresponding to arbitrary 
small increments Ax and Ay in x and y is approximately 


This is a result of the first importance. The increment in z when 

dz 

x varies alone is represented approximately by 

dz 

increment when y varies alone approximately hy — Ay, It is now 

established that the increment in z when x and y vary together ex- 
pressed approximately by the sum of the two increments in the simple 
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directions. The two partial derivatives can be used, therefore, to 
describe the general variation of a function of two variables. But 
it is clearly convenient to have a precise symbolism for this general 
variation and a technique for operating upon the symbols. The 
concept of a “ differential ” and the process of ‘‘ differentiation ** are 
introduced to serve just this purpose. 

Our fundamental result can be written in a variant form am 
f{a + h,b-^k) =:f{a, b) + hfj{a, b) -f lcfy{a, b) 
approximately when h and h are small. Hence if a:=a and j/=6 are 
inserted in / (a;, y) instead of the correct values x = a + A and y = 6 -I- A;, 
then the error in the value of the function is approximately 

b)^-kfy(a, 6 ). 

Again, putting a; = a A, y = 6 + A:, we have 

y) =/(«, 6) + (a: - a) fj{a, 6) + (j/ - 6)/y'(a, h) 
approximately when (a; - a) and {y - 6) are small. This result 
enables us to replace /(a:, y) by an approximate linear expression in 
X and y for small ranges of values about the values a;=a, y=6 
(cf. 6.4 above). 


13.2 The differential of a function of two variables. 

The increment in the value of a function z=f{Xy y) for small 
Increments Ax and Ay in the independent variables can be reduced 

to two parts, a precise expression, ~ Ax Ay, which is of the same 

ox oy 

order of smallness as the increments Ax and Ay themselves, and an 
expression, cAx+rjAy, depending on values e and rj which are small 
when Ax amd Ay are small. The second expression is thus of the 
order of the squares of Ax and Ay, The precise and first-order part 
of Az is termed the differential of the function z—fix^y): 

Definition : The differential of a function z=f{Xy y) with con- 
tinuous partial derivatives at a point {x, y) is 

dz=^~ Ax-{- — Ayy • « 

. dx dy 

where Ax and Ay denote arbitrary increments in the independent 
variables from the point (x, y), 

w 

For convenience, we now denote the arbitrary increment dx by 
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dz and call it the difiFerential of the independent variable x. Simi- 
larly, Ay is written dy and called the difiFerential of y. The difiFer- 
ential of the dependent variable z is then defined in terms of the 
independent difiFerentials by the formula 

, dz dz . 
dz = —dx-\-~ dy, 
dx dy 

It must always be remembered, in using this formula, that dx and dy 
are no more than arbitrary increments in the independent variables. 
An alternative notation for the difiFerential of z ^f(x, y) is 

!/)=/,'{*, y)dx+f^'{x, y)dy^f^dx+fydy. 

The interpretation of the difiFerential of a function follows from 

dz dz 

the definition. The statement that + inaph^s that 

the increment in the value of z=f(x, y) is approximately 

. dz dz . 

Az = ^ Ax-h^ Ay 
dx dy 

when small increments Ax and Ay are allotted to x and y. The difiFer- 
ential dz replaces the increment Az in much the same way that the 
derivative replaces the average rate of change. So 2=/ (a:, y) tends 
to increase (or to decrease) whenever x and y are varied by small 
amounts so that the corresponding difiFerential dz is positive (or 
negative). 

dz ^ dz 
dy^ 

difiFerential of the function z=f(x, y) and from it can be derived two 
partial difiFerentials. When x is varied by an amount dx while y is 
held constant, the difiFerential becomes 

dz = ^ dx 
dx 

dz 

and the partial derivative — is thus the ratio of the difiFerential dz 

(y held constant) to the difiFerential dx. Similarly, the partial 

derivative — is the ratio of the difiFerential dz (x held constant) to 
dy 

the difiFerential dy. A partial derivative can be regarded as the ratio 
of two difiFerentials. 

It can be noticed that the definition of a difiFerential applies 


The expression dz = ~dx-\-^dy is often called the “complete^' 
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automatically to the case of a function of one variable. If y=f(x) 
has a continuous derivative /'(a;), then the differential of t/ is 

dy=f\x)dx 

for an arbitrary increment dxinx* This implies simply that 
Ay=f\x)Ax approximately 

when the arbitrary increment Ax is small. It follows that the 
derivative f\x) is the ratio of the differential dy to the differential dx, 
dy ' 

The notation — is now very useful since it can be regarded either as 

a derivative or as a ratio of differentials. The differential notation, 
however, adds nothing to the derivative notation in the case of a 
function of one variable. It is only when at least two independent 
variables are involved that the differential notation becomes essential 
in the description of variation. 

13.3 The technique of differentiation. 

The process of obtaining the differential of a function is called 
differentiation. The most obvious method of differentiating z=f(Xy y) 
is to evaluate the partial derivatives and to substitute in the funda- 
dz dz 

nqental formula dz — -^dX’\-~dy. But other methods are often 

more convenient. It can be shown quite easily that the rules obeyed 
by differentials are exactly similar to those obeyed by derivatives. 
If u and V are two functions of x and i/, then 

d{u ±v)—du ±dv ; d[uv)=^udv-\-vdu\ d " 2 ^^ • 

Further, if z=f{u) where w is a function of x and y, then 

dz=f'{u)du. 

du 

So d{u**)=nu^’^du; d{e^)=e^du; d{logu) = - 

As examples of the method of proof, we have 
d d 

d (uv) = ^ (uv) dx-^ — (uv) dy 
ax ay 


u 


[ dv du\ , [ dv du\ . 


^‘fhe derivative J'(x) is sometimes called the “differential coefficient”; it is 
the coefficient of the differential dx in the differential of the fimction y—f(x). 
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Again, if z=/(u), then 


dz 

dx 


=r(u) 


du 

dx 


and 



du 

dy 


and dz=~dx + ^dy=f’{u) + 

A good practical method of differentiation is to split up the given 
function into groupings of functions u, v, w, , each of which in- 
volves only one of the variables x and y. The differential of the 
function is expressed in terms of the differentials of u, v,w, by 
the above rules. The latter differentials are then given by ordinary 
single-variable derivatives, e.g. the standard forms 

dx 

d (x**) = nx^-^ dx ; d (e*) = e* rfx ; d (log x)= — , 

X 


and similar results in the variable y. Finally, it is to be noticed that 
du 

the result d (log w) = — provides a method of logarithmic diflferen- 
u 

tiation. For example, if w, v and w are functions of x and y and if 
uv 

z=— , then on taking the logarithm 

log z =log -f log V - log u? 

- dz du dv dw 

and — = — H . 

z u V w • 


The practical technique of differentiation is illustrated by the 
following examples. Each result can be checked by using the 
partial derivatives already obtained (12.2 above). 


Ex. 1. 

Here 


Ex. 2. 
Here 


2 = X* 4- 2xy - y*. 
dz=d (x^) + 2d (x7j) - d (y^) 

•^2xdx + 2{xdy -hy dx) - 2y dy 
^2{x -k-y)dx + 2{x -y) dy, 

z= r • 

x-y 

j (x -y -\-\)d(x^) -xH(x -y -\-\) 
• (x-y^\)^ 

(x -y -\-\)2xdx -x^(dx -dy) 
(x-y + 1)* 

x(x ~ 2y + 2) dx -f X* dy 
(x-y + D* 
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Ex. 3. *=log(z®+y*). 

So 2 =»logM where ««»x*+y*. 


, du d{x*+y^) xAx+ydy 
(tz= — 5 5— == Z S » — • 


Ex. 4. z=x*a/ — 

^ x-y 

So logz = 21ogx + ilog(x+y)-Jlog(x-y), 

, dz ^dx ld(x+y) Id(x-y) 

and — =2 — + ^ ^ ^ 

z X 2 x+y 2 x-y 

_ 4 (x* - y^) dx +x(x -y) (dx + dy) -x(x-{-y) {dx - dy) 

“ 2x(x^-^y^) 

(2x^ -xy - 2y^)dx +x^dy 

i.e. dz = , ^ {(2a:* -ocy- 2\/) dx + x^dy). 

■J(x + y){x-y)^^ 

13.4 Differentiation of functions of functions. 

dz dz 

The differential dz = ir dy iB dejfined for a function of two 

dx dy 

independent variables, dx and dy being then arbitrary increments. 
But exactly the same formula can be shown to give the differential 
of a function in which the variables x and y are not independent 
but dependent on some other set of variables. In this case, dx and dy 
are no longer arbitrary increments ; they are proper differentials 
depending on the arbitrary increments in the actual independent 
variables defining x and y. The proof of this extension of the 
differential formula is too involved to be given here.* It can be 
assumed to hold at least for ordinary continuous functions. Henco, 

The differential of the function z=f{x, y) is given by 

- dz j dz j 
dz=—dx-{-^dy 
dx dy 

whether the variables x and y are independent or not. 

* t 

The* extended formula is of importance in the evaluation of the 
differentials, and hence of the derivatives, of various types of 
functions of functions. The simplest case of a function of a function 

• See, for example, Phillips, A Course of Analysis (1930), pp. 229-31. 
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has already been considered. If z=f{u) where u is a, function of 
X and y, then we have seen that 


dz~f'{u) du 

We can now pass to the two most useful cases of more complicated 
functions of functions. 

In the first case, 2 is a given function of two variables x and y 
which are themselves given functions of a single independent 

dz 

variable t. Hence, z is a function of t and the derivative -j- is 


dz 

required. Write z=f{Xyy) with partial derivatives — and — 

dx • 

Further, write x = <j>{t) with derivative -^ = (fi\i) and yz=i(j(t) with 

■j dv 

derivative = 0 (0- Then 



, dz dz 

dz — — dx-\-Tr dy. 
dx dy 

where 

dx — <f)'(t)dt and dy = iff\t)dL 

So 



dz dz ^ dz .. . 

i.Oc 



The ratio of the differentials dz : dt is to be interpreted at once as the 
derivative of z as a function of t. With a small notational change, 
the derivative can be written 


dz __ dz dx dz dy 
dt dx dt ^ dy dt 

This result is a very simple one and, in practice, all we need do is 

dz dz 

to write the fundamental formula dz—~dx-¥^ dy, divide through 

dxdy 

by the differential dt and interpret the ratios of differentials as 
derivatives according to the definition of the problem. As a par- 
ticular case, iiz—J{x, y), where y is a given function of x, then 

dz dz dz dy 
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In the second case, 2 is a given function of two variables x and y 
which are themselves functions of two independent variables u and v. 
It is required to find the partial derivatives of z considered as a 
fimction of u and v. Write z = f(x, y) where x = <l){u, v) and y = ^{u, v), 
all functions having known partial derivatives. 


Then 


where 


dz—^dx-^^dy, 
dx dy 


So 


. dx ^ dx . 
dx — — du-\-i^dv 
du dv 


dz 
dy du 


__/dzdx 
\9x du 


and dy^^du-\~ dv. 
du dv 

3y\ j (dzdx dzdy\ , 


dz dx dz dy dz __ dz dx dz dy 

dxdu^ dy du dv dx dv ^ dy dv' 


The partial derivative of z with respect to is the ratio of the differ- 
entials dz and du when v is kept constant, i.e. it is the first bracket in 
the above expression for dz. The partial derivative of z with respect 
to v is obtained, in the same way, as the second bracket in the ex- 
pression for dz. Hence, 
dz 
du 

These results are again very simple. In practice, we divide the 

formula dz=^^dx-^^-dy through by either du or dv and interpret 
ox dy 

«the resulting differential ratios as partial derivatives. 

The basic form of the differential formula is thus capable of 
dealing with various types of functions of functions ; it is merely 
a matter of making the interpretation appropriate to the case under 
consideration. This is a striking illustration of the flexibility of the 
differential notation. Our results are still limited, however, to the 
case of single-valued functions and it remains to extend the appli- 
cation of differentials in the construction of a theory of multi-valued 
and implicit functions. 


13.5 Differentiation of implicit functions. 

Two variables x and y are related by a givei^ implicit relation 
/(^> y)*=0- In general, we now have y as a multi-valued function 
of X and x as a multi-valued function of y. The simplest method of 
dealing with such a relation is to introduce a third variable z which 
tak& its value iromf{x, y) for any values of x and y whatever. The 
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single- valued function z=f(x, y) has the given implicit function as 
its zero contour by z = 0, i.e. the given implicit function can be 
studied by relating those values of x and y which make z equal to 
zero in the explicit function z=f(x, y). 

As X and y vary in any way, independently or not, the variation 
of z^f(x, y) is given by the complete differential 

dz =fscdx+ fy dy 

where and fy are the partial derivatives of /(r, y). If {x, y) are 
values maldng 2 = 0 and if dx and dy are variations from these values 
keeping z = 0, then dz = 0 and 

Jxdx -\-fydy = 0 

is the relation between the differential increments satisfying the 
given implicit function / (r , y) = 0. Hence, 

and 

dx- /, dy- I 

Starting from a given point (r, y) satisfying the relation /(r, t/)=:0 
and remaining on one branch of y as a function of x, let Ay he the 
increment in y corresponding to a given increment Ax m x. Then, 
by the definition of differentials, the ratio dy : dx written above is 
the approximate and limiting value of the ratio Ay \ Ax when Ax 
is small. Hence, the ratio dy : dx can be interpreted as the derivative 
of y as a function of x given by /(x, y) = 0, taking only one branch of 
this function through (x, y). In tliis way, the notion of a derivative 
is extended to apply to multi-valued, as well as to siagle- valued, 
functions. The ratio dx : dy is interpreted, in the same way, as the 
derivative of x as a function of y given by the same implicit function. 
Hence, 

The implicit function /(x, y)=0 gives y as a function of x with 

derivative ~ at the point (x, y), and it gives x as a function 

dx f 

of y with derivative -- = - at the point (x, y). 
dy fm 

The two functions^are inverse to each other and it is seen that one 
derivative is the reciprocal of the other. 

One point must now be stressed most strongly. The derivatives 

^ and being expressed in terms of the partial derivativefitof 
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f(Xy y) = 0, are themselves functions of both x and y. The derivatives 
differ, therefore, from those of single-valued functions and the reason 
for this is not difficult to see. If y is a single- valued function of x, 
then it is sufficient to fix a value of a;, the corresponding value of y 

and of the derivative ^ being uniquely determined. The derivative 

of a single-valued function of x depends only on x. But, if j/ is a 
multi-valued function of a;, it is not sufficient to fix a value of x since 
this still leaves several equally possible values of y at choice. The 

dtj 

branch of the function, and hence the derivative ^ , can only be 

determined when the starting-point is specified as regards both x 
and y. The derivative of a multi-valued function is essentiaUy 
dependent on both x and y. 

These remarks can be illustrated by the simple relation 

a;* -f 10 

Here /(x, t/) = - le and /«. = 2a:, = 2y. 


So 


dy X 

^ = at the point (a;, y). 


The values (a:, y) must, of course, satisfy the given relation. In this 
case, y is a double-valued function of x and it is possible to separate 
the two branches : 

j/ = s/l6-a:* and y=~Vl6~a:2. 

On the positive branch, we have the single-valued function 

y= -f •yi6-a:^ 

which has derivative, at any value of a:, given by 
dy _ X 
dx 


n/16- 


x^ 


X 

y' 


On the negative branch, y = >/l6 -a:^ with derivative 

For a given value of a:, therefore, there are two 'different values of 
dx 


, one for each branch of the function. But both values are included 


in the general result 


dx 


- - ; the implicit form of the function and 
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of the derivative expression has thus great advantages even when 
the branches of the function can be separated. 

The derivative of an implicit function given by a relation /(x, y) = 0 
is evaluated at once from the partial derivatives of the expression 
f{x, y). In practice, however, it is often better to differentiate 
f(x, y) = 0 as it stands according to the method indicated in the 
following examples. 


Ex. 1. The relation a;* -2x + 4y + 1 = 0 gives y as a double- valued 

function of x. From 

f{x, y)=z^ + 7f -2x + 4y + l, 
we have = and /y = 2(y-i-2). 


dz y + 2 


Alternatively, differentiating the relation as it stands, 




i.e. 




i.e. 



x-l 

y+2 


as before. 


Ex. 2. The relation a;* + y^ - 3a:y = 0 gives y as a triple- valued function 
of z. From /(x, y) = + y^ - 3xy, we have 

/* = 3x2 _ _ 3^2 _ 3^^ 


So 


d7j 

dz 


y^-z 


Alternatively 

i.e. 


3x* + 3/g-3(y + x|)=0, 


which gives the same value of the derivative. 


The second derivative of an implicit function is defined and 
obtained from the first derivative. y) = 0 gives y as a function 

of X, then from ^ , we obtain 

ax Jy 


dx* dx \fy/ \3x \fj By \f j dz] 


M 
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l(fa 

dx \/, 


— /««/ » fxvfx ^ /v»/» ^ 


Substituting these expressions and the value of ^ , we have 

f fxxfll ~ fxvfx fxvfv ~fvvfx fx\ 


= - T-s (/**/»' - 2y*v/*y* +Avy/). 

Jy 

The second derivative thus involves the first and second-order 
partial derivatives of /(a;, y) and is a function of z and y. For 

d'^j z 

example, the relation a;^ + 2 /* = 16 gives — and 

S=-s(;)=4f 


1 / dy\ 1 ( z^\ 

~ \ ^ V ^ 2/ / 




\ ax/ y^v y J y^ y^ 

In diagrammatic terms, the implicit function /(a;, y) = 0 is repre- 
sented by a curve in the plane Ozy. In general, one branch of the 
curve passes through a point (a:, y) satisf 3 dng the relation and there 
is a definite tangent to the branch at the point. The tangent gradient, 
referred to Oz, is then given by the value of the derivative 

l=-f; 

Further, the curvature of the branch at the point is indicated by the 
d^y 

value of ^ at (a;, y). Our previous discussion of the applications of 

derivatives now extends at once to the case where we have a 
curve with an implicit and multi-valued equation. It must be 
remembered, however, that the results refer only to a single branch 
of the curve, the branch passing through a specified point {z, y). 

The equation of the tangent at a point (aj^, y^) on the curve 
f{Zy y) = 0 is readily obtainable. The tangent is the line passing 
through the point (a^j, y^ with gradient to Oz given by the value of 

^ at this point. The tangent equation is thus 


y^(«-»j)+y»,(y-yi)=o. 
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where and fy^ are the values of the partial derivatives of / (x, y) 
at the point y^). This is a generalised form of the tangent 
equation previously obtained (6.5 above). 

For example, in the case of the circle with equation 
x^ + y^-2x-i-4y-tl=0, 

we have f^ = 2{Xi-l) and /y, = 2 -f 2). 

The tangent at (x^, y^) on the circle has equation 

(Xi - 1) (x - Xi) + (2/, + 2) (y - 2/i) -0, 

i.e. - 1)^ + (2/i + 2) y -f 2y^. 

Since the point (x^, y^) lies on the circle, we have 

- 2^1 + 4yi + 1 =0, 

i.e. Xi2+yj2_^^_^2yi=:Xi-2yi- 1. 

The equation of the tangent is thus 

(Xi - l)x + (j/i + 2)y = Xi - 2yi - 1. 


13.6 The differential of a function of more than two variables. 

The definition of a differential extends at once to the case of a 
function of more than two variables. The variation of u=/(x, y, z) 
as the independent variables vary is given by the differential 


, dU y dU ^ dU y 

du = ~ dx -i- ^ ay — - dz. 
ox dy dz 

This implies that, if Jx, Ay and Az are arbitrary small increments 
in the independent variables, then the increment in u is approxi- 
mately 


. du . du . du . 

Au = - - Ax + ^ - Jy -h . Az, 
dx dy dz 

The complete differential is again the sum of the separate (approxi- 
mate) variations due to changes in x only, in y only and in z only. 
The differential formula again extends to the case where x, y and z 
are no longer independent variables and it can be used to give the 
derivatives of functions of functions of various types. 

The variation of variables x, y and z connected implicitly by some 
relation /(x, y, 2;)=0 must satisfy 


f^dx +fydy+f,dz = 0y 

where /a;, and /, are the partial derivatives of /(x, y, z). Rega^^g 
2 as a multi-valued function of x and y defined by tliis relation, we 
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obtain the partial derivative ^ as the value of the ratio dz : dx for 

y constant. Putting dy=0 in the above equation, we have 

f^dx+f,dz=0. 


i.e. 

Similarly 


— — — _•?* 

dx \dz/ y constant f» 

— — - h 

By ~ \dy} 

X constant ft 


In diagrammatic terms, a three-dimensional surface with equation 
in implicit form f(x, y, z) = 0 haa gradients (referred to Oxy) in the 
two fundamental directions given by the above ratios of partial 
derivatives. The tangent plane at j/i, z^) thus has equation 

z - z, = ( -^■) (a: - a;,) + ( (y - yj (see 12.5) 

i-e. /*. (x - x^) +/,, (y - y^) +/.. (z - Zi) = 0. 

More generally, for a function y=f(xi, ... x^) of n independent 
variables, we define 


Tne complete differential contains as many additive terms as there 
are independent variables. The interpretation and application of 
this formula are exactly similar to those already discussed in the 
simpler cases above. 


13.7 The substitution of factors in production. 

The production function x =f(a^ b) for two variable factors of 
production gives a system of constant product curves 

/(a, 6) = constant. 


In the normal case, one curve of the system passes through each 
point (a, 6) of the positive part of the plane Oab and is downward 
sloping and convex to the origin (at least over a certain area of 
the plane). Differentiating and denoting marginal products by 



and 



o 


/a^a ■f/5d6=0 


is jibe (approximate) relation between increments da and db in the 
factors along the constant product curve through (a, 6). This 
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relation holds for any point (a, 6) we care to select. Hence, the 
tangent gradient of the constant product ciuwe through (o, 6) is 

^=Jji 

da ft 

when referred to the axis Oa. In the normal case and over the 
relevant area of 0a6, this gradient is negative and its numerical 
value is termed the marginal rate of substitution of the factor B for 
the factor A in the production of the good X : 

da ft- 

The value of r depends on the combination of the factors considered, 
i.e. is a function of both a and 6. It represents the additional amount 
of the factor from the given combination of factors, necessary to 
maintain product unchanged when a small unit reduction is made 
in the use of the factor A, 

Since the constant product curves are convex to the origin in the 
normal case, the value of r must increase as b increases (and a 
decreases) along the constant product curve. The convexity con- 
dition is thus the expression of the principle of increasing marginal 
rate of substitution, of the assumption that it becomes increasingly 
more difficult to substitute B for A as the substitution proceeds. 
The interesting point now is to determine how fast r increases, i.e. to 
measure the ‘‘ elasticity ” of substitution. For any change from 

(a, 6) along the constant product curve, d 

crease (or decrease) in the use of £ as compared with that of A and 

dr = d the corresponding increase (or decrease) in the marginal 

rate of substitution. The ratio of these differentials, expressed in 
proportional terms to make them independent of units of measure- 
ment, is defined as the elasticity of substitution between the factors 
at the combination of factors considered. Hence, 

Definition : The elasticity of substitution between A and B is 



r 


where the differentials correspond to a variation along the constfifht 
product curve through (a, 6). 


-j represents the in- 
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In diagrammatic terms, or appears as the ratio of the relative 
increase in the gradient of OP to the relative increase in the tangent 
gradient of the constant product curve at P as the point P moves 
along a constant product curve in the plane Oab. 

The value of a can be written in terms of the partial derivatives 
of r or of the production function itself. We have 


d 


adb -b da 


and 


- dr j dr 
dr=^ da-h:iT do. 
da db 


But db= da = -r da, and so 

Jb 


d -] = 


ar + 6 


da and dr — - [r^ - da. 
ar^b 


Hence, 


ab dr 


dr 

da 


Evaluating ^ ^ ^ ^ ^ terms of the first and 

second-order partial derivatives of / (a, b), it is found that 

fafbWa+bfj,) 

abT 


where P = - (faaft^ - ^fabfafb +fbbfa^)- 

From this last result, it appears that, although a has been defined 
for the substitution of B for A, the same value is obtained when 
the substitution is of -4 for B. The elasticity of substitution is 
symmetrical with respect to the two factors. 

The curvature of the constant product curve at (a, b) is 

da * da \da) ~ da ^ \5a ^ db da) db da 

d^b 

Hence, a is a positive multiple of the reciprocal of i.e. it is 


positive and inversely proportional to the curvatime of the constant 
product curve. The larger is the value of a, the flatter is the constant 
product curve and the more slowly does the marginal rate of substi- 
tution increase as P is substituted for A. The magnitude of a is 
thus an indication of the ease with which product can be maintained 
b/^ubstituting P for There are two limiting cases. If A and P 
are perfect substitutes, so that product is maintained by increasing 
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B in proportion as A is decreased, then the constant product curve 

is a straight line, ^ is zero and a is infinite. If A and B are entirely 

incapable of substitution, being needed in a fixed proportion, then 
an increase in one of the factors from this proportion must leave 
product unchanged. The constant product cmve has a right angle 

at the point concerned, ^ is infinite and a is zero. As a increases 

from zero to infinity, substitution between the factors becomes 
increasingly easier. 

So, the elasticity of substitution a varies from one combination of 
factors to another, is independent of the units in which factors and 
product are measured, is a symmetrical relation between the factors, 
is positive for all normal combinations of factors and varies from zero 
to infinity according to the ease with which one factor can be sub- 
stituted for the other in production. 

When the production function is linear and homogeneous (constant 
returns to scale), a becomes of simpler form. Since 

/aa=-^/a(, and /m = - ^/a> (12.8 (4) abovd) 

we have T = (a/„ + 6/,)* 

and = (by Euler’s Theorem). 

{afa + bU)fal> 

Using the alternative notation for partial derivatives, 

dx dx 
da db 
^ ”” d'^x 

^ dadb 

When there are constant returns to scale, a is inversely proportional 
to the cross second-order partial derivative of the (linear and homo- 
geneous) production fimction.* In the particular case where the 

(X b 

production fu^iction is x=Aa^b^-^, it is easily seen that r=Y-— - 
and that (7= 1 at aU points. 

* It was in this form that a was first defined by Hides, The Theory of Wages 
(1932), pp. 117 and 245. See, also, Robinson, The Economics of Im^^xfeci 
Competition (1933), pp. 256 and 330, Lemer and Kahn, Notes on the Elasticity 
of Substitution. Review of Economic Studies. 19.‘13- 
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13.8 Substitution in other economic problems. 

Exactly similar definitions can be given for the marginal rate and 
elasticity of substitution for an individual consumer’s scale of 
preferences. A system of indifierence curves for two goods X and Y 
is defined by the equation 

if) y) = constant. 

The tangent gradient to the curve through the point {z, y) is 

dX <f)y * 

In the normal case, the curves are downward sloping and convex to 
the origin. The marginal rate of substitution of the good Y for the 
good X in consumption, defined as 

dz'^if),^ 

is thus positive and increases as the substitution of Y for X proceeds. 
It is to be noticed that R is the ratio of the “ marginal utilities ” of 
X and Y and independent of the form we assume for the individual’s 
utility function (see 12.8 above). But there is no need to consider 
“ marginal utility ” at aU ; the expression R is sufficient for the 
description of the indifference map. The elasticity of substitution 
between X and Y is then defined as in the production case, taking 
. values between zero and infinity according to the ease with which X 
and Y can be substituted in constimption to maintain a given level 
of indifference.* 

A similar analysis applies for a consumer’s preference scale for 
incomes in two years. If x and y are the amounts of this and next 
year’s incomes, we have an indifference curve system given by 

y) = constant. 

The marginal rate of time-preference is defined as the numerical 
gradient of the indifference curve through (r, y), reduced by unity f 



* See Hicks and Allen, A Reconsideration of the Theory of Value, Economics, 
1934. 


f The reduction of 



1 is made since we wish to define s zero time* 


preference as the case where an equal addition to next year*s income compen- 
sateSnor a given reduction of this year’s income, i.e. where ^ = 1. This 

is only a matter of the scale on which time-preference is measured. 
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The elasticity of substitution between the two incomes is then 
defined exactly as before. 

The same concepts are also of use in other problems, e.g. in 
problems involving the notion of a transformation function (see 
11.7 above). If two goods are produced with given resources so that 

F(x, y) = 0 

is the relation between the outputs x and y, then the expression 

) F 

^ gives the marginal rate of substitution of Y production 

for X production. Similarly, if x and y are this year’s and next 
year’s incomes obtainable from given resources, the expression 
written is the marginal rate of return over cost. The corresponding 
elasticities of substitution then follow as before except that a 
negative sign must be added, the transformation curves being 
concave, and not convex, to the origin in the plane Oxy, 


(Ja 

\ dx 


13.9 Further consideration of duopoly problems. 

In the duopoly problem with conjectural variations zero, as 
analysed in 8.8 above, the reaction curve Ci of the first duopoUst 
gives a?! as a function of x^ defined by 

f{x^, Xj) = 0 (x) + Xiiff' (x) - = 0. (where x = Xj + x,) 

CtXj 

The equilibrium outputs are then determined by the intersection of 
Cl and the similar curve (7, for the second duopohst. For stable 
equilibrium, we require that Ci be downward sloping with a gradient 
(to the axis Oxj) less than unity and similarly for Cg. It is now 
possible to investigate the implications of these conditions. 
Differentiating the above implicit relation. 




where 


0'(x)+Xij/r"(x) 

^ 2/7 ' 

dxi^ 


i+p' 




and numerical!; 

that p is positive. It is assumed, in the normal case, that the 


Hence, is negative and numerically less than unity pro'^ded 
dx^ 
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demand function p = 0(a;) is decreasing and that marginal cost 

is increasing. It follows that the numerator of p is positive, 
condition we require then reduces to 


dx^ 

The 


which is satisfied either if ^*\x) is negative or if though 

positive, is less that — { - These alternative conditions apply 

to the convexity of the demand curve and one or other is satisfied 
in many cases of demand laws, e.g. in the case of the linear demand 
law p=P-cxx. 

A rather more complex duopoly problem is the following. The 
first duopolist is assumed to act as before, expecting his rival’s 
output to be xmafiected by his own actions. The reaction curve 



dx 

is then obtained with gradient as given above with value lying 

(1X2 

between - 1 and 0 in the normal case. The second duopolist is 
assumed to forecast correctly the effect of his own actions on his 

dx! ‘ 

rival’s output, i.e, his conjectural variation is as obtained from 

(1X2 

The second reaction curve gives X 2 as a function of x^ defined by 

(where x=Xi+Xt) 
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dx 

Since < 0 and 0 < 1 + y- < 1 , the left-hand side of this equation 
is greater than the form {ip(x) -\-X2iff'(x)} in the case where his con- 

Hence, for any given Xi, must be 


jectural variation is zero. 


greater than before and, since this is assumed to be an increasing 
function of Xg, the value of X2 must be greater than before. The 
new reaction curve C2 is thus farther to the right than the old curve 
C2, 0 x 2 being horizontal as in Pig. 85 . If x^ and X2 are the old 
equilibrium outputs, and x^ and X2 the new, it follows that Xj^ <Xi 
and X2>X2- The second duopolist gains by being “alive’’ to his 
rival’s reactions. Further, since has a numerical gradient (to Oxg) 
less than imity, the sum of x^ and X2 is greater than the sum of 
Xi and X2, at least for a small shift in Cj* The joint output is greater, 
and the common price charged by the duopolists smaller, than in the 
previous case. 


EXAMPLES Xm 


Differentiation 

1, If h and k are increments in x and y from the point (x, y), find •The 
increment in z — x* + 2xy -y* in terms of a:, y, h and k. What is the difference 
between tliis increment and dzJ Show that the difference is of the second 
order in h and k, * 


2. Write down the equation of the tangent plane at (a, 6, c) on the surface 
z=f(Xy y) and show that the increment in the height of the plane when incre- 
ments dx and dy are given to x and y from (o, 6, c) is dz. Hence illustrate the 
approximation of dz to the increment in z. 


3. Evaluate the differentials of z = x* + f/* 

8 = log . 

x^y 


Zxy ; z = Vx* + y* ; z~e* 


4. Obtain the differential of z = (a; + y ) (a; - y) by logarithmic differentiation 
and check by differentiating z = a;* - y* directly. 

5. Take logarithms and differentiate z = (x + y)*dx - y and z = (a:-i' 


6. Find the dif^rential of u = (a;* + 2a:y - y*)e*. 

7. Show that /(a:, y) =/(0, 0)^-xf/(0, 0) 4 - y/y'(0, 0) approximately when 
X and y are small. Deduce that, for small values of x and y, 

- — ? — I - X'^-y approximately. 

1 -f a; - y 

Find the difference between these expressions in terms of x and y. 
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8. Show that = K® ” + 2) approximately when x and y are each 

nearly equal to 1. What error is involved in the use of this approximate value 
when a:= 1*2 and y = 0-9 ? 


9. The volume of a right-circular cone of base radius x and height y is 
i7rx*y. If a; is estimated as a with an error of h and y ash with an error of k, 
show that the error in the estimate of the volume is \iTa{2hh^ak) approxi- 
mately. 


10. For the cone of the previous example, show that the proportional 

Ax Ay 

inorecuse in volume is approximately 2 — + — when x and y are increased by 

Ax and Ay. Deduce that a 1 per cent, increase in base radius increases volume 
by the same percentage amount as a 2 per cent, increase in height. 


11. If pv = at is the relation between the pressure (p), volume (v) and tem- 
perature (0 of a gas, a being a constant, show that the proportional increase 
in volume is approximately equal to the difference between the (small) pro- 
portional increases in temperature and pressure. Deduce that a simultaneous 
1 per cent, increase in temperature and pressure leave the volume approxi- 
mately unaltered. 

dz 

12. From the expression for dz, find ^ when z = a;* + 1 /* where a; = 1 + < and 

y = 1 - 1. Express z as a function of t and check your result by direct 
derivation. 


13. Evaluate ^ if z = — - ■ , where a? = e* and y = er*, 

dt x-\-y ^ 

14. Find and if z = log (x - y), where a? = - and y = ~ 

du dv ON V u 

function of u and v and check by direct partial derivation. 


Express z as a 


16. If z = f(x, y), where x^a-\- at and y — b-\-pt, show that 


dz 

di 


dz dz 


and 


dH 


d*z 


dH 
dx dy 


+ iS* 


0yi- 


16. If * —S(x, y), where x = au-\-av and y = bu + pv, show that 


dz dz ^ dz 

du dx dy 


j dz ^ 


Generalise by finding the partial derivatives of/(x, y, z, ...) as a function of 
u and V if x^au + av, y = bu + pv, z = cu + yv, ... . 


17. Find ^ when z =f{x, y), where x and y are functions of t and show that 

d*z__ 0*z /dxV 0*z dxdy 0*z /dyV dzd*x dzd*y 
dt* dx* \dt/ ^ dxdy dt dt^ dy* \dx) ^ dx dt* ^ dy dt* * 


18. A function f(x, y) is homogeneous of the rth degree if f(u, v) = X^f{x, y) 
where t» = Aa; and v = Xy. Differentiate each side with respect to A, put A = 1 
and show that 


and 


x^’^y rf{x, y) (Euler’s Theorem) 
ox &y 

!)/(*.»)• 
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19. If *=/(*» y)» where x = Xxo and y = Ai/o vary in a fixed proportion, 

consider 2 as a function of A and show that ^ = l fx ^ . If € = - ^ 

dX X\ dx dy/ z dX 

is the elasticity of z=f(x, y) for proportional variation in x and t/, show that 
dz 02 

a; — + 1 / ^ = € 2 . If the function is homogeneous of degree r, show that € = r 
and so deduce Euler’s Theorem. 
dy d*t/ 

20. Find ^ and from the implicit function {x - a)* + (y - 6)* = c*. 

21. If aa;* + 2/ia:t/ + 62/*4-2^a;+2/y + c = 0, show that t ? and 

evaluate the second derivative. fix + by +/ 

dy 

22. Find ~ and when x* + y*- 3xy = 0. Show that y is a maximum 
when x = i/2. Interpret in terms of the graph of the function. 

23. If X* + y* + 2 * - 3ary = 0 defines 2 as a function of x and y, find ~ and 

0a; dy 

Deduce the second-order partial derivatives of 2 . 

24. Given f{x, y, z) = 0, find the first and second-order partial derivatives 
of 2 as a function of x and y. 

25. If a;* -f y* - 2ax 4-1 = 0 defines y as a function of x, a being a constant, 
show that 2a:y “ -f x* - y • - 1 = 0. 

26. Show that the tangent line at (Xj, y^) on the curve aa;*-f6y* = c is 

oxxj 4- 6yyi = c, and that the tangent plane at yi» on the surface 
ax* 4- by* 4- C 2 * = d is oxx^ 4- byy^ 4- czz^ = d. •• 

27. Explain the meaning of each of the six partial derivatives, of one 
variable in terms of another, obtainable from the implicit function /(x, y, z) = 0^. 
Express each in terms of the partial derivatives of / and deduce that 

dz dy dx_^ ^ 
dy dx dz 

28. Two relations <f>{x, y, z) = 0 and ^(x, y, z) = 0 define y and z as functions 

dy j dz . 

of X. Differentiate each relation and obtain expressions for — and -j- m terms 
of the partial derivatives of ^ and ijt. 

29. Differentiate the relations xy4-y3-2* = 0 and x* + y-z = 0 to give 

^ euid ~ . Solve the equations to obtain y and z explicitly in terms of x and 

cLx doc 

obtain the derivatives directly. 


Econo mic applications of differential 

30. If x=f{a, b) is a linear homogeneous production function, show that 
the marginal raie ol^substitution between the factors depends only on the ratio 
6 : a of the factors used. Verify in the particular cases 

, ^ _ 2Hab - Aa* - Bb* 

x=^*J2Hah - Aa* - Bb* and x= Ca-\-Db 

31. Show that the property of the previous example is also true^f the 
quadratic homogeneous production function and illustrate with the function 
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x=2Hab - Ao* - R6*. Is the property true for any homogeneous production 
function, e.g. for x = Aa“6^ ? 

32. If the production function is x — 2Hah - Aa* - show that at any 
point (a, 6) on a constant product curve 

db Hh^Aa d*b (H^-AB)x 

da~ Ha-Bh ^ da*~ (Ha - Bb)* ' 

Deduce that the first derivative is negative and the second positive provided 
H A 

that — a>6>— a. Deduce that the constant product curves are of normal 
B H 

form in the area between the lines Hb - Aa = 0 and Ha - Bb = 0, 

33. Show that, for the production function x=^^2Hab - Aa* - Bb\ 

Hx' 

(H‘-AB)ab ■ 

34. Show that the production function x = Aa°6^ gives a— I for all com- 
binations of factors. 

35. Show that the elasticity of productivity c (10.8 above) is the elasticity 
of the vertical section of the production surface through the axis Ox and the 
given point on the surface. Use the results of Example 19 above to show that, 
for any production function, 

dx _ dx 

a L 5 — = ^x, 

da db 

36. If tx = log(a;-f o)«(i/ + 6)^ is one form of a utility function, find the 
marginal rate of substitution between the goods X and Y and deduce that the 
elasticity of substitution is 

hocx 4- a^y 

<«. a = 1 “H ;rr . 

(oc-^P)xy 

37. If ox* 4- 6i/* = constant is the transformation function for two goods 
K and Y, show that the marginal rate of substitution of Y production for 

X production is and that the elasticity of substitution is always unity. 

38. Two goods X and Y are produced jointly with the aid of two factors 
A and B. Technical conditions define a relation /(x, y, a, 6) = 0, so that 
(e.g.) the production of Y is determined if the amounts of the factors used and 
the production of X are given. Show how a and b can be used as parameters 
to describe the shifting of a transformation curve between X and Y pro- 


duction. What do the partial derivatives 


dx* da 


~ represent? 


39. If, in the problem of the previous example, technical conditions give 
two production functions x~ <l> (a, b) and y—t)t(a, b), show that 

=_A_ 

Kdx/ V constant 

and interpret the meaning of this derivative. , ^ 

40. Of the duopolists of Examples VIII, 46, the first acts (as before) on the 
assumption that the second does not change output while the second correctly 
estimates the first’s output changes consequent upon his own changes. Show 
that the total equilibrium output is now approximately 44 sets per week. How 
is thi/Tbutput divided ? 



CHAPTER XIV 

PROBLEMS OP MAXIMUM AND MINIMUM VALUES 


14.1 Partial stationary values. 

In our discussion of the partial derivatives of a function of two 
variables (12.4 above), we postponed consideration of the cases 
where one or both of the derivatives assume zero values. Suppose 
dz 3z 

now that ^ — ^ ^ ^ function z=f{x, y) at a point (a, 6). 

Then z has a stationary value for variation in x from the value a, y 

d^z 

retaining the constant value 6. The sign of — at (a, b) determines 

^ ox^ 

whether z is a maximum or a minimum.* The vertical section of the 
suiface z=f{Xy y) which passes through P (where x=^a and 
the W.E. direction has a maximum or minimum point at P. 

3z 

In general, the condition — = 0 is an implicit relation between 

OX 

X and y represented by some curve in the plane Oxy. Points on the 
surface z=f{x, y) above this curve correspond to maximum or 
minimum values of z in the W.E. direction. Some of these points 

may correspond to maximum values (satisfying the condition 

dH \ ^ . 

— < 0 j and make up what can be called a ridge line of the surface 

in the W.E. direction. Others of the points may correspond to 

( d^z \ 

— > 0 j and define a trough line of the surface in 

the W.E. direction. The curve in Oxy giving the ridge and trough 
lines of the surface is clearly located on the contour map of the 


* Throughout the present chapter, the possibility that stationary values 
can include points of inflexion, though not overlooked, will not be taiion into 
account. 
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surface. The tangent gradient of the contour through the point 
(X, y) is 

Bz 

dy _ Bx 
dx~ Bz* 
dy 
dz 

which ifl zero whenever ^ = 0. The ridge and trough lines of the 

surface thus lie above the locus of points on the contour map where 
the tangents are parallel to the axis Ox. For example, in the case 
shown in Fig. 87 below, the line AB across the contour map corre- 
sponds to a ridge line of the surface in the W.E. direction. 

^z dz 

A similar analysis holds if ~ 9^0 and ~=0 at any point. The 

cx 


dy 


function then has a partial maximum or minimum value (according 

d^z\ ' 

to the sign of ^ j for variation of y alone. The ridge and trough 

lines of the corresponding surface are then obtained in the S.N. 
direction. They lie above the points on the contour map where the 
tangents to the contours are parallel to Oy. The line CD across the 
contour map of Fig. 87, for example, represents a ridge line of the 
surface in the S.N. direction. 

o 

The analysis extends without difficulty to functions of more than 
two variables. In general, if y=f{Xiy ajj, ... x„), there are n different 
kinds of partial stationary values, each corresponding to the vanish- 
ing of one of the n partial derivatives of the function. Further, 
though the analysis is expressed in terms of explicit functions, it 
applies equally well to functions written in implicit form. 


14.2 Maximum and minimum values of a function of two or more 
variables. 

If both the partial derivatives of the function z=f{x, y) are zero 
at the point (a, 6), then z has a stationary value for variation in 
X alone and for variation in y alone from (a, 6). J'urtfner, 


- dz . dz j _ 
dz=^ dx ^ ^ dy 
dx dy 

% 

for any values of dx and dy, i.e. z has a stationary value for any 
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variation whatever from the point (a, b). Such a point is said to 
define a stationary value of the function. Ignoring the possibility of 
inflexional points, there are three cases to distinguish : 

(1) The value of z is a maximum for any variation of x and y from 

(a, 6), the surface z=f{x, y) having a “peak** at the point 
where x—a and y = b. 

(2) The value of 2 is a minimum for any variation, the surface 

having a ‘‘ hollow ” with lowest point where x=a and y=b, 

(3) The value of 2 is a maximum for some variations and a mini- 

mum for other variations from (a, 6), the surface having a 
“saddle point** where x=a and y=b, i.e. a point where 
there is a “ pass ** over a “ ridge ** of the surface. 

The three cases are illustrated in Fig. 86. 




Stationary values, occurring where the tangent plane to the 
surface is horizontal (parallel to Oxy), are located on the contour 
map of the surface at points where the locus of points with tangents 


parallel to Ox = 0^ cuts the locus of points with tangents parallel 
to Oy ^^ = 0^ There is a stationary value wherever the rid^e or 
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trough line in the W.E. direction cuts the similar line in the S.N. 
direction. The lie of the contours around such a point determines 
the nature of the stationary value. If the contours tend to close 
down to a single point as their height increases or decreases, then the 
stationary value is a maximum value of type (1) or a minimum 
value of type (2). If a contour crosses itself at the point and there 
is no tendency to close down on the point, then the stationary value 
occurs at a saddle point of type (3). The cases are fuUy illustrated 
by the particular contour maps of Figs. 87 and 88. 

In more rigid analytical terms, the function z=f(x, y) has a 
maximum value at (a, 6) if 2 is a maximum for any variation of x 
and y from the values a and b respectively. A minimum value of z 
is defined similarly. Maximum and minimum values together make 
up the extreme values of the function. It is assumed that the function 
and its partial derivatives are finite and continuous.* A maximum 
(minimum) value can occur only if the function is a maximum 
(minimum) in each of the two fundamental directions in which x 
and y vary alone, i.e. only if the two partial derivatives are separ- 
ately zero and the function has a stationary value. To distinguish 
maximum from minimum values, or either from other stationary 


values, is a matter of some difficulty. 


If — and — are both negative 

kt a stationary point, then z must be a maximum in each of the 
fundamental directions and it is likely that 2 is a maximum in all 

d^z d^z 

directions. Similarly, if — and ~ are both positive at such a point, 

then 2 is likely to have a minimum value for all directions of varia- 
tion. But these results are by no means certain. We must exclude 
the possibility that z behaves differently in some directions as com- 
pared with the fundamental directions. It is shown below (14.7, 
Ex. 3 and 19.1) that 


dx^ dy^ 


>IJ^y 


\dx dy) 

• 

is a condition sufficient to exclude this possibility. Hence i 


• Afl in the one-variable case, a maximum or minimum value of z =f{x, y) 
can oacur at a point where the partial derivatives are not continuous. Such 
a point is not shown by the criterion below. 
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CRITERION FOR MAXIMUM AND MINIMUM VALUES 

(1) All maximum and minimum values of z =f(x, y) occur where 

dz = 0 for aU variations of x and y, i.e. where 

_ 32 _ ^ 
dx dy 

(2) If g and g are negative and such that g g > (^)’ at 

a point x = a, y — b, then /(a, 6) is a maximum value of the 

function. If — ^ ^ ^re positive subject to the same 

inequality at a point x = a, t/ = 6, then /(a, 6) is a minimum 
value of the function. 

The first is a necessary condition. The second condition when added 
to the first is a sufficient condition and defines situations in which 
only a maximum (or only a minimum) value can occur. But the 
condition is not complete ; it is not necessary and sufficient. It 

is possible to show that, whenever the 

dx^dy^ \dxdy/ 

stationary point concerned is a saddle point. But the case where 

d^z d^z ( dH 

— — = y——j is still open and the stationary value concerned 

may be a maximum or minimum value or it may not. 

The criterion holds, in exactly the same form, for functions given 
implicitly. Iff{x, j/, z) = 0 defines z as a function of x and y, we have 
only to write 

dx f.^dy 'f. 

and the criterion applies. The necessary condition for a maximum 
or minimum value of z reduces to /»=/» = 0. 

The criterion of maximum and minimum values can be extended 
to functions of more than two variables, but only as regards the 
necessary condition at this stage. All the maximum and minimum 
values of y ••• ^n) occur at points where dy = 0 for all 

variations in the independent variables, i.e. where 
dy dy _ _dy 




dxi dx. 


dx„ 


Similarly, {{/(x^, X 2 , ... y)=0 defines y as a function of the other 
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variables, then the maximum and minimum values of y occur where 

/«, —fxt = • • • =/«n ~ 

Sufficient conditions are given later (19.1). 

An important feature of the results here given is that, since the 
differential of a maximised or minimised variable is zero at the point 
concerned, we can treat such a variable as a constant in the process 
of differentiation. Suppose that, in the relation 

/(^i, ... a:„, y, z) = 0, 

the variable y is to be maximised for variation in x^y ... a;„, the 
variable z remaining fixed. Differentiating and putting dy = 0 (since 
y is maximised) and = 0 (since z is fixed), we have 

/*. +/*. da:, + . . . dx^ = 0 

for any variations in the variables a:,, ... a:„. So 

/os, =/ *, = ••• —fxn ” 

Exactly the same conditions obtain if z is maximised while y is 
held constant. There is no distinction, in differentiating, between 
variables which are maximised (or minimised) and variables which 
are held constant. 


,14.3 Examples of maximum and minimum values. 

Maximum and minimum values of a function z=f{x, y) are 

dz dz 

located in practice as follows. The equations ~ = — = 0 are solved 

for X and y and each solution examined in turn. In the majority of 
cases, the sufficient conditions (involving the second-order partial 
derivatives) enable us to distinguish maximum and minimum 
values. In some cases where the conditions fail, we can examine the 
contour map of the function in the neighbourhood of the point 
concerned. And, even if the sufficient conditions apply, the contour 
map provides a useful illustration of the situation. The following 
examples illustrate the method : 


Ex. 1. z = 

dz 

Here = and 
ax 



There is one stationary value (z=0) occurring where x^y^^O and 
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v^alue must be a minimum since all other values of z are positive. The 
sufficient conditions are found to support this, since 

dx^ dy^ dx dy 

The contour map of the function has trough lines along Ox and Oy and 
is shown in Fig. 76 above. 

Ex. 2. z^2x-\-y -vxy ~y^. 

dz dz 

Here ^ = ~2x + t/ + 2 and ^ = x - 22 / + 1 

and stationary values of z occur where 

2x-y-2=0 and a:-2y + l=0. 

There is a single stationary value (2=|), where a;=f and j/=t. 

S-., 5^^=. 5-., 

the sufficient conditions show that z = | is a maximum value. The contour 


o 

m2- 



• • Fia. 87. 

laap of the function is shown in Fig. 87, rising contours closing down 
on the point where x , j/ =i* The line AB {2x — y-"2=0)isa ridge line 
in the W.E. direction and the line CD (a; -2y + 1 =0) is a ridge fine in 
the S.N. direction. 
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Ex. 3. 
Here 


z = -xy 


dz 

dx 


= 8a;-y-3a:*; ^=-a: + 2y; 


d^z 


33 - 2 - 2 3x); 

Stationary values of z are given by 

3a:*“8a;+y=0 and x^2y. 

On solving (e.g. by eliminating y), we find two stationary values, 2=0 
where a; = y = 0, and where a;=f, y=f. The first is a minimum 

value of 2 since, when x = 2 / we have 

3^2 2>0 and 


dy 

d^z 


= -1 and 


ay*” 




= 16>(- 


9^2 
9x 9y 


a. 


But when x = f and y = 5 the two direct partial derivatives of the second 



order are opposite in sign ( - 7 and 2) and the stationary value here occurs 
at a saddle point. Fig. 88 shows certain curves of the contour map of the 
function. The curve ABC (the parabola y = 8x - 3x^) represents the 
ridge and trough lines of the surface in the W.E. direction. The part 
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f 4 d^z \ 

^ = 0 and ^2^^) trough line and the part 

/ ^ d^z \ 

, g ^==0 and ^< 0 J is the ridge line. The line DE (a? = 2 y) is 

( dz d^z \ 

~=0 and ^> 0 j. The lie of 

the contours about the two points of intersection of the line and the 
parabola show that there is a minimum value of z at the origin and a 
saddle point at a: =f, y =5. 

Ex. 4 . + 3z^ ~2x + 2z =0, 

Taking 2 as a function of x and y, we find 

_ a; - 1 rt ^ 

The only stationary values of 2, for variation of x and y, occur at a; =* 1 , 
y =0. On substituting in the original equation, two values of 2 are found, 
2 xa - 1 and 2 = J. Further, we find 

when a; = 1 , y *= 0 . Similarly, at the same point, 

1 

dx dy dy^ 82 + 1 * 

So, when 2 = ~ 1, the direct second-order partial derivatives are positive 
and the cross derivative is zero. By the sufficient conditions, z— - 1 is a 
minimum value of 2. In the same way, 2 == J is found to be a maximum 
value of 2. The solution can be illustrated by a contour map of this 
double- vr lued function (see Examples XIV, 9 ). 

14.4 Monopoly and joint production. 

A monopolist produces amounts and X2 of two goods and 
at a total cost of n=F{Xi, x^. The two goods are related in con- 
sumption ancj the demands of the market are 

= where and jpa are the prices charged. The net 

revenue of the monopolist is 

Making use of the demand relations, this is a function of the two 
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prices. It is taken that the monopolist fixes prices to maximise net 
revenue. The necessary condition for this is 

dpi dp, ’ 

. / dn\ dx, [ dIT\ dx, 

^ dxj dpx ^ V * dxj dpi 


( dn\ dx, ( 
V ‘ dxj dp, 


377\ dx^ 


dxJdp^'K^^ dxjdp^ 

There are two relations to determine the prices. Subject to the 
prices obtained satisfying the conditions for a maximum (rather than 
a minimum) value of j/, the problem is solved. 

To illustrate the solution, a simple case can be considered in 
which the cost and demand functions are linear : 

“t* 0C2^2 > 3^2 =^2 ^12Pl ^22P2* 

The average costs of producing the goods can be taken as constants, 
oj and cxj respectively. Denote 

^10 ”” ““ ®12^2 Q'Ud •^'20 ~^2 *” ®22®^2> 

i.e. the demands when the “ competitive ” prices (equal to average 
costs) are charged. Then 

“^ 2 ( 7 ^ 2 "" ^2) 

and ^ 2 ~^ 2 o “ ®i 2 (jPi ”■ ^ 1 ) "■ ^ 22 ( 1^2 * 2 ) 

and the monopolist’s net revenue is 

y =^iPi *b ^2P2 ~ ^=3*1 (;?i - ai) + ^2 (P2 - «2)- 

dpi dpi dpi 

=^10 “ 2 a^i(Pi “ «i) “ ^^uiPz “ *2) 

By 

and = x^ — 2ai2 (Pi ~~ ^i) ■" 2a22 (Pi ■" * 2 )* 

Putting these expressions equal to zero for maximum net revenue and 
solving, we find the monopoly prices 

, ^22^10 ~ ®i2^2o _ , ^11^26 ~ ^.2^10 


Fortker, ^,= -2au; - 

^ dpi^ dpidp. 


- 2aj, and 


9*!/ 

5 —-= - 2 o,,. 

dp,* 


If Ou and a,, are positive and such that ai»*<a„a„, then the prices 
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give a maximum value of net revenue. These limitations on the 
values of the constants are assumed below. 

Suppose, now, that two independent monopolists produce and 
X2 with costs ni = (XiXi and ©ach assuming that the other 

will not change his price and maximising net revenue. The net 
revenue of the first monopolist is 


2/1 — ^iVi (Pi ~ “1) 

where the demand gives as before. For a given value of the 

maximum net revenue is obtained when ^ = 0 , i.e. when 

dpi 


^10 ”■ {Pi — cci) - <h2(Pi “ <^2) — 
Similarly, for the second monopolist. 


^20 “ ^12 (Pi “■ ®i) ~ 2^22 (1^2 “ ^2) — 


Solving these two equations, the prices charged by the independent 
monopolists are found to be 


Pi = «!■+' 


^22^10 1^2^20 

2 (< 3^1022 


and J?2 = «a + 


^^20 ~ 1 ^ 2^10 
2 (aiiU22 ” J®12*) 


d^ij 

Since -t^ = - 2 au <0 and similarly for -r^> the prices coirespoffd 
CtP\ ^P% 

to positions of maximum net revenue. 

If 0^2 is negative, so that the goods are (in the broad sense) com- 
petitive in consumption, it is seen that the joint monopoly prices are 
greater than the separate monopoly prices ; the effect of joint mono- 
polistic production is to restrict output and raise price. If Ojj is 
positive and the goods are complementary, there are two cases : * 


X X 

( 1 ) — and 0^12 "^^22 * 

XiQ X20 

Here, for the joint monopoly prices, (Pi-ai) and (Pa-«a) 
positive and both prices are above cost as represented by aj and aa. 
It may or may not be that both prices are above those of separate 
monopoly. This ease holds when, as in the majority of situations. 


* Notice that the case where and is exactly parallel 

with and adds nothing to case ( 2 ) and that it is impossible, since 
for to be greater than both — and o,, — . 
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the relation between the goods (as shown by Ou) is weak as compared 
with the direct price effects on demand (as shown by Ou and Oj,). 

(2) and ax,>aM^. 

^10 *^20 

Here, - a^) is negative while {^2 ~ ^ 2 ) positive, i.e. the good 
is sold by the joint monopolist at a price below cost aj, and certainly 
below the separate monopoly price. It pays the monopolist now to 
use one good as a “ loss leader ” in order to promote the sales of the 
second good. The conditions for the case are (roughly) that the goods 
are strongly complementary { 0^2 la-rge) and that the demand for the 
“ loss leader ” is sensitive to changes in its price (On large). For 
example, a monopoUst manufacturer of razors and blades may gain 
by marking down the price of razors (for which the demand is 
elastic) below cost to promote the sales of blades (for which the 
demand is less elastic and strongly complementary with that for 
razors).* 


14.5 Production, capital and interest. 

To generahse the problem of 10.3 above, suppose that a variable 
time elapses between the beginning of a production process and the 
final output of a good X produced with two factors A and B. The 
•amount of output x depends, according to technical conditions, on 
the amounts a and 6 of the factors used and on the length t of the 
period of production : 


a:=/(a, b, t). 

It is assumed that the prices of the factors are given as and and 
the market rate of interest as lOOr per cent, per year compounded 
continuously, all taken (for convenience) in terms of the product x. 
Further, it is taken that the factors are paid off at the beginning of 
the production. So 


y=: 


xe 


-ri 


= <f>(a,b,t) (say) 


is the product discounted to the beginning of production as a ratio 
of the cost then incurred. It is assumed that the entrepreneur is a 


• For an analysis of the “ loss leader ’* problem, see Roos, Dynamic Eco- 
nornu^ (1934), pp. 128-47. On problems of joint production, see also Stackel- 
berg, QruncUagen einer reinen Kostentheorie (1932) and Hicks, The Theory 
of Monopoly, Econometrica, 1936. 
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monopolist arranging production to maximise y. The variables to 
be chosen are a, 6 and t and we must have 


Since 
we have 


da db di 

log y=^\ogx-rt~ log -f 6p^), 

1 ^ 1 ^ „ 

yda xda + 

\dy ^\dx pf, 

ydb xdb + 

\dy _^\dx 
y di X ~di 


So 


ap^^bp^ dx 

X da ' 


^ ap^^-bp^dx ^ Idx 

Pf, = ^ and r = - -- 

X db xdt 


are the necessary conditions for a maximum value of y. These 
equations, with x=f(ay 6, t), determine the equilibrium values of 
a, 6, t and x as dependent on the given values of p^, Pb and r. 

Suppose now that there is pure competition among entrepreneurs 
so that the market rate of interest adjusts itself to make (discounted) 
receipts just cover cost. The given value of r is such that the 
maximised value of y is unity and 

xe-^*=ap^ + bpi,. 


So 


,dx , ,dx 

and pj,=ze-^*—- 


da db 

Production is extended until the discounted marginal products of the 
factors are equal to the fixed prices of the factors. 

For example, suppose that the production function is 

x = Aa^b^-H^, 

where A is constant and a and ^ are fixed positive fractions. For a 
fixed period of production and variable factors, there are constant 
returns to scale. The necessary conditions for monopoly equilibrium 
are • • 


ap^ + bpt 


= -a) 


apa + bpb 


and 


'f 


Idx a 13a: 1-a , 13a:_jS 

x da ~~ a* xdb~~ b xdt 1 


since 
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Q 

The period of production is here determined solely by the rate 

of interest and not by the prices of the factors. The first two con- 
ditions are equivalent and imply that (1 -a)apa equals a6p>, 


. CL OL Pi, 

l.e. 7=r — . 

0 1-apa 

Only the ratio of the amounts of the factors used is determined ; the 
actual scale of production is immaterial (constant returns to scale). 
The ratio, further, is fixed by the prices of the factors and not by 
the rate of interest. Notice also that 


7 ^ = = constant, 

bpi, 1 - a 

i.e. the relative share of the two factors in the product is constant, 
unaffected by changes in their prices or in the interest rate. 

Under competition, with the particular production function now 
assumed, we have the added condition 

— = constant, 

X 

i.e. the combined share of the factors in the product is constant, 
independent of their prices or the interest rate. The share of capital 
(as represented by the entrepreneurs) is then constant also. In the 
competitive case, in fact, the relative shares of the factors and of 
capital in the total product are constant, no matter how the prices 
of the factors or the interest rate change.* 


14.6 Relative maximum and minimum values. 

A different problem, that of relative maximum and minimum 
values, arises when we seek the maximum or minimum values of a 
function z=f{x, y) where the variables are not independent but 
related by some given relation if>{x, y)=0. The given relation is 
often termed a side relation. The side relation determines one 
variable (say j/) as a function of the other. Substituting, 

= (say) ' 

and our problem is effectively one of maximising or minimising a 
function of one variable. In simple cases, the side relation gives y 

* Compare the results here obtained with those given by Edelberg, An 
eeonomeirie model oj production and distribution, Econometrica, 1936. 



PROBLEMS OF MAXIMUM AND MINIMUM VALUES 365 


843 a single-valued function of x, in which case iff (x) also appears in 
single-valued form and our problem is solved without difiBculty. 

In general, the side relation is multi-valued and a more subtle 
method of attack is needed. But the essential idea is still the re- 
duction of z to dependence on one variable only. Taking x as the 
independent variable and assuming that all partial derivatives are 
finite and continuous, the function and side relation give 


dz . . 


dx 


and 




So 


dx 


Equating this expression to zero, we obtain i 


The function z=^f(Xy y) has maximum or minimum values 
relative to the side relation <f>(x, y) = 0 only at points where 


This is a necessary condition for maximum or minimum values. 
Adding the side relation, there are two equations to be solved for x 
and y. Each solution is a possible maximum or minimum situation 
and must be examined further. We find 


dx^ 




4>J'‘)dx 


~ - Vxy<l>x<f>y +fyy<f>x^) “ ^4>xy<}>x<f>v + <f>yyi>x*)} 

Yv 

on reduction. The sign of this expression at a point given as a 
solution of the necessary condition determines whether the point 
corresponds to a maximum or minimmn value of z. 

In diagrammatic terms, instead of locating the peaks and hollows 
of the surface z=f{x, y), we consider only a definite path on the sur- 
face, defined by the side relation <f>{x, y)=0, and determine where 
the path attains its highest or lowest height above the zero horizontal 
plane Oxy. Vsh^ the contour map of the surface, our path is a 
given curve crossing the contours of the map and we seek those 
points on it where the highest or lowest contours are cut. It is clear 
that, at these points, the given curve touches a contour (see Fig. 89 
below). The tangent gradient of the given curve y)=0 is then 



366 


MATHEMATICAL ANALYSIS FOR ECONOMISTS 


equal to that of the contour /(a:, j/) = constant, i.e. 



which is the necessary condition above. 



For functions of several variables, we seek the maximum or 
minimum values of y=f{Xiy ... x^) subject to a given side relation 
<f>{xiy x^y ... The side relation gives x^ as a, function of the 

other (n - 1) variables for which 


dx, dx, 

Then y is a function of (n - 1) independent variables and 






and similar expressions for the other partial derivatives. At a 
maximum or minimum position, all these partial derivatives must be 
zero. It follows that 


^x, <^*. 4>Xn 

is the necessary condition for a maximum or minimum value of t/. 
There are (n-1) equations here and these, with the side relation, 
determine certain sets of values of Xj, Xg, ... x^ amongst which are all 
those giving maximum or minimum values of y. Sufficient con- 
ditions for distinguishing maximum and minimum values are given 
later (19.2). 

In a more general problem, the maximum or minimum values of y 
are required subject to r given side relations between the variables 
(where r<n). An alternative method of approach can now be used 
to advantage.* In the case of two side relations 


<t> {Xi, X x„) = 0 and <p (x^, x„) = 0, 

write z =/(Xi, Xt, ...x„)-X(f> {Xi, x x„) - [jaI> [x^, x^, ... x„). 

Then z must have a maximum or minimum value for any values of 
A and y, whatever (since the coefficients of A and y, are zero). But 


• The method, which also applies in the simple case (r= 1) already analysed 
is known as Lagrange’s method of solution by “ undetermined ” multipliers 
See, fof example, Osgood, Advanced CalcuVua (1926), pp. 180 et seq. and de la 
VaJl^e Poussin, Coura d' analyse infiniUsimaly Vol. T (6th Ed. 1026), pp. 147-9, 



PROBLEMS OF MAXIMUM AND MINIMUM VALUES 367 

Take the two side relations as giving and X 2 as functions of the 
other (n ~ 2) variables which are independent. Then we can choose 

dz dz 

A and /x so that the expressions for — and — are zero. For a 

OX-^ 0 X 2 

maximum or minimum value of 2 as a function of the (n ~ 2) inde- 

dz dz 

pendent variables, all the partial derivatives — , ~ — , ... must be 

ox^ ox ^ 

zero. In this way, we have altogether n equations : 

/*, = 0 (r=rl, 2, 

These, together with the two side relations, determine certain values 
of A and and of the n variables Xj, X 2 y ... x^. Amongst these values 
are those which give the relative maximum or minimum values of y 
required. SuiSicient conditions cannot be written here. 

14.7 Examples of relative maximum and minimum values. 

The following examples illustrate the practical solution of relative 
maximum or minimum problems. It is important to note, in each 
case, which variables are taken as independent. 

Ex. 1. To find extreme values of z=x^ relative to 

y^ - 4x - 2y ^ 4 — 0, •• 

From /(x, t/) = X* + we have /x = 2x and fy^2y. Again, from 
4>{Xy y) +2/2 ~-4x - 21 / +4, we have ^a. = 2(x -2) and = 2(y - 1). At* 
a relative maximum or minimum value of 2 , we must have 

x-2 y-\' 

Hence, x == 2y, Substituting in the side relation, 

52/2 -lOy + 4=0. 

So y = 

The corresponding values of 2 are 

z = x2+y* = 5(l±^)*=2(3±V6). 

By examining the sign of the second derivative of 2 as a function of x, 
it can be shown that the plus sign above corresponds to a relative maxi- 
mum and the minus sign to a relative minimum value of 2 . Or, Fig. 89 
shows certain contours of the surface z=»x2+y2 (i.e. circles centred at 0) 
and the curve representing the side relation (a circle centred at x»2, 
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y — 1). It is seen that this latter circle touches two of the contours. For 
one point of contact the value of z is a relative minimum 

z = 6 - 2J6 = 1-53 approximately, 

which is the square of the radius of the smaller contour touched. For 
the other point of contact 

z = 6 + 2^6 = 10*47 approximately, 
the square of the radius of the larger circle touched. 



Ex. 2. To find the dimensions of the rectangular block of maximum 
volume which can be cut from a sphere of radius a inches. 

Let the sides of the block be 2a:, 2y and 2z inches. Simple geometrical 
considerations show that, for maximum volume, the comers of the block 
must he on the sphere’s surface with diagonals as diameters of the sphere. 
Hence, x^+y^+z^^a*. We require the maximum volume V -%xyz 
subject to this relation. 

From /(x, y, z) = Sxyz, we have = 8yz ; /^ = 8xz ; /, = 8xt/. 

From <j>(x, y, z) =x* 4.^2 .^^2^ we have = 2x ; = 2y ; = 2z. 

Hence, for the relative maximum of V, 

4yz _ 4xz _ 4x3/ , 

X y n 

It follows that X* = = z*. 


Since x* +z* =a* and x, y and z are positive, we find 
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These values clearly correspond to a (relative) maximum volume 


The block of maximum volume is a cube of side -j^ inches. 

Ex. 3. The function z =/(«, y) has a stationary value at (a, 6), so that 
=0 at this point. The relation 

y -b=m{x-a), 

where m is a fixed value, indicates a particular direction of variation of 
X and y from (a, 6). To find the nature of the stationary value of z in this 
definite direction : 

The side relation gives y as a function of the independent variable x 
and we find 

ciij 

“ = m, (from the side relation) 
dz . , du , , „ ... 


^ =fx +fy ^ =/x + mfy = 0, at (a, 6) 


/*»+/vv^Q 


fxx '^fx] 


^fxx XV Vvv 

=/.v ( m Jf') + 

\ Jvv/ Jvv 

(from the original function). • 

The sign of this last expression, for the given value of m, determines 
whether z is a maximum or minimum. 

If /pa, and fyy are both negative and such that 

fxxfvv^ fxy^ 9 

d^z 

the expression shows that ^ is negative whatever the value of m, i.e. z 

has a maximum value in aU directions from (a, 6). These are sufficient 
conditions that /(a, b) is an (unrestricted) maximum value of the function. 
If fxx fyy are both positive subject to the same condition, it follows 

similarly that f{a, b) is a minimum value of the function. 


• • 

14.8 The demand for factors of production. 

The output of a good X when amounts o and b of two factors A 
and B are used is given by the production fimction x=f{a, b). It is 
assumed that the prices of the factors are fixed at p, and pt and that 
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a given output x is required at minimum cost. We choose a and 6 
to minimise n=apt,+bpt subject to f{a, b)~x. Taking a as the 
independent variable, the side relation gives 

da // 

- dll db fa - 

is the necessary condition for minimum cost, 


i.e. 


/. /*■ 

The factors are employed in amounts such that their marginal 
products are proportional to their given prices. Further, 


dm_d (dn\_ ^ 

da^ ~~da \da ) da^ * 
d^b 

So, if > 0, then the position concerned corresponds to a minimum 

cost. The condition implies that the constant product curve at the 
point concerned is convex to the origin. The equilibrium position 
is thus stable at all points where the constant product curves are 
convex to the origin. 

The necessary condition with the side relation determines the 
demands a and b for the factors in terms of the given values of 
p 5 and X, The (minimised) cost is then determined in the same 
way. As the given output x is changed, the prices of the factors 
remaining constant, the demands for the factors vary and the cost 
of production varies to give the ordinary total cost function. 

. In diagrammatic terms, the contour map of the function 


/7=apa + 6p5 


consists of a set of parallel straight lines in the plane Oab. The 
contour for fixed cost iT. is a line with gradient equal to f - — ) and 

n n ^ \ pJ 

cutting the axes at distances — - and — - from 0. As 77, is increased 

Pa Pb . , 

the line moves away from 0 parallel to itself. If a given product x 
is to be obtained, then all possible combinations (a, b) of factors lie 
on the corresponding constant product curve. We seek that com- 
bination with minimum cost, i.e. lying on the “ cost line of tho 
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parallel set which is nearest 0. The combination is given by the 
co-ordinates of the point P where the constant product curve 
touches one of the cost lines. If the constant product curve is of 
normal form (downward sloping and convex to 0), there is only one 
such point P and it corresponds to minimum cost. At P, the 

gradient of the constant product curve ( ) and that of the cost 

f p\ . ^ 

line ( — 1 are equal. This is the necessary condition above. 

\ Pb' 

As the value of a; is changed, 
we obtain a series of different 
constant product curves and a 
series of points Pj, P 2 , ... where 
they touch cost lines 

AjPj, A.2B2i ... . 

This is shown in Fig. 90 where the 
values of x{oi^t ... ) increase. 

As z changes continuously, the 
points describe a curve in the 
plane Oab, The varying co- 
ordinates of a point on the 
curve show the variation of 

demand for the factors as the output z varies. Further, the varying • 
cost corresponding to the cost lines touched defines the total cost 
function for varying output z. 

If the production function is linear and homogeneous (constant 
returns to scale), then Euler’s Theorem gives 



So 


bpb ^^ <^/a + 

XX X * 


where A denotes the common value of Pa : fa and pj, : /j. Now 


dx 


=P, 


da 


db . ( f da db\ 


Bince 




We consider here a variable equilibrium position with output x 
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changing and and p^ fixed. Hence, average and marginal cost are 
equal at all outputs. Average cost must thus be constant irrespective 
of output (see Examples XVI, 22, below), a natural result in this 
case of constant returns to scale. In Fig. 90, the constant product 
curves are now of the same shape (radial projections of each other) 
and the locus of P is a straight line. The factors are used in the same 
proportion and their amounts and total cost increase in proportion 
as output increases. 

If the good X, produced imder constant retimis to scale, is sold on 
a competitive market at a price p equal to the constant average 
cost, then 


/. /» 



So 


Pa=Pf<, and Pi=pU 


This is the law of marginal productivity ’’ ; the price of a factor 
equals the marginal product of the factor valued at the selling price 
of the product. Suppose, further, that the demand law for X is 

doc 

a: = ^(p) with elasticity determine the amounts 

(a and b) of the factors and the selling price (p) in terms of given 
factor prices (p^ and p^), we have now three conditions : 


(1) /(a, b) = <f>{p) ; (2) Pa=pfa ; (3) Pt=pff 

The output x=f{a, b) = <f>{p) and total cost n=px=apa + bp^ are 
also determined. Full competitive equilibrium, with constant 
returns to scale, is thus determinate. 

We can trace the variation of the demands for the factors of pro- 
duction, given above in terms of p, and p^, as one of the prices 
(say Pa) changes while the other (pj) remains constant. For the 
linear homogeneous function x=f{a, b), we have 


/..= --/«» and /»=-^/a» (12.8, (4), above). 
The elasticity of substitution between the factors is 
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The equations (1), (2) and (3) hold for any values of p, and and 
can be differentiated partially with respect to p, : 

da db dp x dp 






da . dh 

+/. 

■'a 




). 


/ da . db\ 


Making use of (2), (3) and (4), we have : 

dp da db 

dp b da db xp 

3pa a dp„ dp, p. 

dp da a db ^ 

9pa 3pa b^ dp, 

dx) da 

There are thus three linear equations in the “ unknowns ”, ^ , 5 — 
db 

and ^ • By simple algebraic methods,* the values of these partial 


derivatives can be foimd and written ; 

da _ a Zap, ^ 

3p,~ p,\xp^ xp V 

and 


apa 

dp. 


ab 

xp 


Write = — — and ~ for the elasticities of demand for 

Ep, a dp, Ep, b dp, ^ 

the factors with respect to p^ and and — ^(/Ca4-/Cfc = l) 

xp xp 

for the proportions of total receipts going to the factors. Then 

Ea 

and ■^ = Ka{(r-rj). 

The interpretation of these results is clear. If the price of the 

. _ , dp , . . . 3a , 06 

• For exeunple, by eliminating , we obtam two equations m -r — and - — 

which are easily solved. Or, the method of solution by “ determinants can 
be utilised (see 18.6, Ex. 2, below). 
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factor A rises, the demands for both factors are affected, and in two 
ways. In the first place, the cost of production is increased, the 
product is dearer and (for a decreasing demand law with rj positive) 
less of the product is bought. There is then a proportional decrease 
in the demand for both factors. This is shown by the negative term 
( - K^rj) in the expressions for both elasticities of demand. Secondly, 
the factor B is now cheaper relatively to the factor A and it pays to 
substitute B for A in production as far as possible. The demand for 
B thus increases at the expense of that for .4, as is shown by the 
positive term in the expression for one elasticity and the 

negative term ( - k^o) in the other. The total effect on demand is 
found by addition. The demand for A falls in any case, but that for 
B may rise or fall according as the substitution effect is stronger or 
weaker than the effect through the demand for the product. 


14.9 The demand for consumers’ goods and for loans. 

A similar analysis applies to the demand of a consumer for two 
goods X and Y obtainable at given market prices p* Pv The 
consumer has a fixed income fi and makes his purchases to attain the 
highest position on his preference scale, as represented by an index- 
function of utility „ = ^ (a- 

We have to find z and y, the individual’s purchases, so that u is a 
maximum subject to the relation expressing the balance of the indi- 
vidual’s budget : 

This expression of the problem is independent of which particular 
form of the index-function of utility is selected ; all forms of the 
• index-fimction increase and decrease together and so attain their 
maximum values together, at the same values of x and y. 

The side relation gives y as a function of x for which 



Py * dX Py 

So 


and 

dht f dy\ f 


and 


dx* 


= 0 . 


+ f 




= ^Myy'P» - ^<f>xyPxPy + <f>xxPy*)- 
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The necessaxy condition for a relative maximum value of u is thus 

Pc, p/ 

i.e. the marginal rate of substitution between the goods in consump- 
tion (R = <f>x : <f>y) is equal to the ratio of the given market prices. 
This condition, together with the side relation, is suflScient to deter- 
mine the purchases of the individual in terms of the given values of 
Pa;, Pv and fly i.e. to determine the demand functions of the individual 
(see 11.7 above). A sufficient condition for a relative maximum 
value of is 

4^VvPa? ~ ^^xvPmVv "i* 4^xxPy^ ^ 

Using the necessary condition, this implies that 

In diagrammatic terms, the purchases of the individual are given 
by the co-ordinates of the point P in the Oxy plane where the fixed 
line with equation xpx + ypy=fiy expressing the budget balance, 
touches one of the indifference curves of the system 

<f> y) = constant. 

The point P corresponds to a maximum value of w if the indifference 
curve at P is convex to the origin. The condition of tangency is 
equivalent to the necessary condition and the condition of convexity 
to the sufficient condition written above. So, in the “ normal ’’ case 
where the indifference curves are downward sloping and convex to 
the origin, a unique and maximal pair of individual purchases are 
defined for any given market prices or given income. The “ normal ’’ 
case, in fact, corresponds to the case of stable demand functions. 
The variation of the individual’s demands for the two goods ajs one 
of the market prices or the income varies is shown by the changing 
position of the point P as the line xp^^-k-ypy—p moves in conformity 
with price or income variation. ♦ 

We can consider, as a second problem, the conditions under which 
an individual regulates the flow of his income, and hence of his con- 
sumption, ove« time. This is a problem of importance in the theory 

* For more complete accounts of the problem of demand for consumers* 
goods, see Hicks and Allen, A Reconsideration of the Theory of Value, Eco- 
nomica, 1934, and Allen and Bowley, Family Expenditure (1935). Expressions 
are here found for the elasticities of demand with respect to price or fhcoroe. 
The methods used are extensions of those first introduced by Pareto. 
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of interest. ♦ It is assumed that the individual has given resources 
which can produce various incomes £x this year and various incomes 
iy next year subject always to a given transformation relation 

F{x, y)=0. 


Interest is reckoned at the fixed rate of lOOr per cent, compounded 
yearly. We determine, first, those incomes which maximise the 
present value y 


subject to the transformation relation. We have 

1 dy dH __ 1 d^y 

dx^ l-frdx dx^~~ l^-rdx'^ 

where the derivatives of y with respect to x are given by F{x^ y) = 0. 
Hence, for maximum present value, we must have 



- 1 , 


i.e. the incomes in the two years are such that the marginal rate of 
return over cost equals the given rate of interest. The sufficient 

condition for a maximum present value < 0^ is satisfied in the 

** normal ” case where the transformation curve in the plane Oxy is 
concave to the origin (see 9.7 above). 

The necessary condition above and the transformation relation 
determine the incomes x^ and yo maximum present value, both 
incomes depending on the interest rate r. It is now assumed that 
the individual can modify his incomes by borrowing or lending 
money this year to be repaid next year with interest at the fixed 
market rate. This process leaves the present value of income un- 
changed, i.e. y 






1 -t-r ■ 1 +r 

where x and y are any incomes as modified from x^ and y^ by loans. 
For example, if the individual borrows (x - Xq) this year, then 

yo~y = (l+r)(x-.Xo) 

is the amount he must repay next year, and this is equivalent to the 


♦ Thi analysis her© given is based on that given by Fisher, The Theory of 
Interest (1930). 
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relation written above. It is assumed, further, that the loans made 
by the individual are such that the utility function 

u = 0(a:, y) 

of the individual for incomes in two years is maximised subject to 
the relation above. The particular form of the utihty function 
taken is quite immaterial. The relation gives 



-(1+r). 




The necessary condition for relative maximum utility is thus 



i.e. loans must be made so that the marginal rate of time-preference 
of the individual becomes equal to the market rate of interest. This 
condition, together with the constant present value relation, deter- 
mines the incomes x and y in terms of the given interest rate. In 
particular, (x-iCo) ^ deter- 
mined as a function of r, the 
demand function of the in- 
dividual for loans. The value 
of {x - aJo) may be positive or 
negative according to the 
interest rate, i.e. the individ- 
ual may borrow or lend this 
year to reach his “ optimum'' 
income stream. As in the 
case of the consumption goods 
problem, if the indifference 
curves, if/ {x, y) = constant, are 
convex to the origin at all 
points, then the positions obtained are stable and correspond always 
to maximum, rather than to minimum, values of utility. 

The problem can be put in a slightly different way capable of 
simple diagrammatic representation. Suppose that the individual, 
before borrowing or lending, has incomes Xq and y^ in the two jears. 
Represent these incomes by a point in the plane Oxy (Fig. 91). 
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By borrowing or lending, the individual can modify his incomes to 
X and y where the point (x, y) must lie on the line A'B' with equation 



i.e. the line through with fixed gradient (to Ox) which is negative 
and numerically equal to (1+r). The incomes selected by the 
individual are given by the co-ordinates of the point P' where A'B' 
touches an indiflference curve of the system 


iff {Xy y) = constant. 

If the indifiFerence curves are everywhere convex to the origin, then 
P' is uniquely determined and corresponds to a maximum of utility. 
The initial incomes x^ and y^ can be varied according to the given 
transformation relation F{Xy y)=0 and the point can take any 
position on the corresponding transformation curve QR. As Pq 
varies, so does the point P' and we require that position of P' giving 
the largest possible utility level. The position required is given by 
P where an indifference curve is touched by the hne AB (parallel to 
A'B') which touches the transformation curve at some point Pq. 
In other words, we seek a line with fixed gradient ~ (1 +r) which 
touches the transformation curve at Pq and an indifference curve 
at P. The incomes first derived by the individual are given by the 
co-ordinates of Pq and the final incomes, as modified by loans, are 
.given by the co-ordinates of P. In this position, the numerical 
gradient of AB (I +r) is equal to that of the transformation curve 
(P, : Fy) and to that of the indifference curve (iff^ : ifjy), i.e. r equals 
the marginal rate of return over cost and the marginal rate of time- 
preference. Notice that, for a line with fixed negative gradient 
-(l-hr), the present value of the incomes represented is constant. 
The maximum value of this present value attainable is given by the 
particular line AB, Hence, as in the previous analysis, we can first 
locate Pq for maximum present value of incomes on the transfor- 
mation curve, and then locate P for maximum utility of incomes 
obtainable by borrowing or lending from those at Pq. 


EXAMPLES XIV . , 

General rnaxiimim and minimum prohlema 

1, Indicate the contour, ridge and trough lines of each of the functions 
zsTX*-^y*-l; Show that x^y = 0 gives a 

stationary value of z in each case. Determine the nature of the stationary 
value in the various cases. 
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2. Show that z = oa;* -l- 63/* + c always has a single stationary value which 
is a minimum, a maximum or a saddle point according as a and h are positive, 
negative or of opposite signs. 

3. Show that z = a;* + aji/ + 1/* has a minimum value and z = a;* + a^y - 1/* a 
saddle point at a: = y = 0. 

4. Show that z = oar* 4- 2hxy + hy^ -I- 2gx + 2fy 4- c has two straight ridge or 
trough lines and deduce that the fimction has only one stationary value. 

6. If z = (a; - show that there are ridge and trough lines which are 

straight and parallel and that z has no stationary value. 

6. Show that z = (a;* + t/*)e**~y* has a minimum value at x = t/ = 0 and 
saddle points where a; = 0, y= dbL 

7. Indicate the form of the contour map of z = a;* 4- y* - Zxy and show that 
the ridge and trough linos are arcs of parabolas. Deduce that z has a minimum 
value at a; = y = 1 and a saddle point at a; = y = 0. 

8. Locate the stationary values of z = 2(a; ~ y)> - x* - y^. 

9. If z is a fimction of x and y defined by x* + y* 4- 3z* - 2x 4- 2z = 0, show 
that the contours are circles centred at x= 1, y = 0 which expand as z increases 
( - l<z< - I) and then contract as z increases further ( “i<z<a). Deduce 
that z= - 1 is the minimum and z = J the maximum value of z. (See 14.3, 
Ex. 4 above.) 

10. Show that the only stationary points on the surface z = x*e*' form a line 
of minima lying along the axis Oy. Verify by examining the nature of the 
sections of the surface by planes perpendicular to Oy. 

0Z 0Z 

11. If -- = (ox + 6y 4- c) (x, y) and — = (ox 4- 6y 4- c) «/f (x, y), show that z iAs 

a line of stationary values corresponding to ox -f 6y 4- c = 0 on the contour map 
in the plane Oxy. Illustrate with z = x*ev. • 

12. If u~f(z) where z~ fj>{x^ y), show that the stationary values of m and z 
occur at the same values of x and y. If/'(z)>0 always, deduce that corre- 
sponding stationary values of u and z are also of the same type. 

13. The variable p is defined as a function of four variables o, 6, t and t by 

the relation ale^^ -^-bref" =f(af 6, t), where I and r are constants and / is a 

given function. Find four equations to determine a, 6, t and r for maximum p, 
(See Wicksell, Lectures on Political Economy^ Vol. I (English Ed. 1934), p. 181.) 

14. By eliminating y, show that z = e^4i^ has a minimum value e* relative 
to X - y 4- 2 = 0. Check by means of the results of 14.6. 

15. If X and y are positive, show that the maximum value of xy subject to 

X* 4- y* = o* occurs where x = y = . Illustrate by drawing the contour map 

n/2 

of z = xy and the circle x* 4- y * = 4. 

16. Tlie variJbleS x and y are related “4-^—1, where a>h. Show that 

^ a* o* 

(x*4-y*) has then a maximum value a and a minimum value 6. Illustrate 
diagrammatical ly when a = 5 and 6 = 3. 

17. Establish the necessary condition for an extreme value of z=/(x, y) 
relative to ^(x, y) = 0 by Lagrange’s method, using an ** undeterftiined ” 
multiplier A. 
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18. Find the stationary values of 2 = + subject to y = mx where 

m is a constant. For what values of m is z (a) a maximum and (6) a mini- 
mum at x = y = 07 Deduce that x=y^0 is a saddle point of the surface 
* = X* + y* - 3xy (see Excunple 7 above). 

19. A rectangular brick has a given volume ; show that its surface area is 
least when it is a cube. Show that the same result holds if the brick has a 
given surface area and maximum volume. 

20. An open box of given volume 4 cubic feet has a rectangular base and 
vertical sides. It is made of wood costing 1 shilling per sq. foot. Show that, 
for least cost of construction, the base of the box is a square of side double 
the depth. What is the least cost? 


21. An ** ellipsoidal ” block of wood has a surface with equation 

re* y* ** , 

— ^ + - 1=1 

a* 6* c* 

Scibc 

referred to axes Oxyz in space. Show that ^ is the volume of the largest 

3 kjo 

rectangular brick that can be cut from the wood. 


22. If axes Oxyz are fixed in space, what is the distance of the point (x, y, z) 
from O ? Express the distance from O to any point on the plane 3x + 4i/ + 6z = 10 
in terms of the x and y co-ordinates of the point. Deduce that s/2 is the 
shortest and perpendicular distance from O to the plane. 

23. Given z = m/(a;) + n^(y) where ma; + ny = a and m + n = 6, find z as a 
function of x and y for given values of a and 6. Deduce that z has extreme 
values for variation of x and y where 

rf/ y 

(See Wicksell, op. ctf., p. 140.) 


Economic maximum and minimum problems 

24. A monopolist firm produces two chocolate lines and X, at constant 
average costs of 2s. 6d. and 3s. per Ib. respectively. If p^ and p, are the prices 
charged (in shilling per lb.), the market demands are 

iCi = 6(p,-Pi) and a:, = 32 + fip^ - lOp, 

thousemd lbs. per week of the two lines. For maximum joint monopoly 
revenue, show that the prices are fixed at nearly 4s. 6d. and 4s. 9d. per lb. 
respectively. 

25. In the problem of the previous example, find the prices of the two fines 
fixed by independent monopolists and show that they are less them the joint 
monopoly prices. 

26. A monopolist produces cheap razors and blades at* a constant average 
cost of 2s. per razor and Is. per dozen blades. The demand of the market 

per week is jt, = thouseind razors and a?, = ~ — thousand dozen blades 
PiPt . PiPt 

when the prices are pj (shillings per razor) and p, (shillings per dozen blades). 
Show that the monopoly prices, fixed jointly, are 4s. per razor and 28. per 
dozen blades. 
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87> If the demands of the market in the previous example are 
*, = ll-2p,-2p, and *,=¥- 2p, - 

show that the joint monopolist would give the razors away and sell the blades 
at a high price. 


28. The market demand law for a good X is p = j5 - ax, A monopolist 
produces X at average cost {ax -f- b) for output x and sells to a merchant at a 
price ir which maximises his profits. The merchant is a monopolist with 
constant distributive costs and maximises his profits by selling on the market 

B — b 

at price p. Show that the amount of X produced and sold is x=-^ — 

^ ^ ^ 2(a + 2a) 

and that 7r = - 2(xx and p = j5 - oar. Find the output if the producer mono- 
polist sold direct to the market and show that “ bilateral monopoly ** here 
restricts output and raises price. 


29. Generalise the problem of the previous example by taking a market 
demand law p= and producer’s average cost /(x). Show that 

<f> (x) + 3x</>'(x) + x*<f,"{x) =/(x) + xf'{x) 

gives the output x and that 7r= <l>{x) + x<f>' (x). (See Bowley, Bilateral Mono- 
poly. Economic Journal, 1928.) 

30. In the problem of 14.5, assume that the product after t years represents 
the original cost accumulated at some continuous interest rate of lOOp per 
cent. Given the factor prices, express p as a function of o, 6 and t. The 
monopolist is assumed to arrange production to maximise p ; find equations 
for a, 6 and /. If the maximum equals the market rate of interest r, show that 
the result is equivalent to that of the competitive case of 14.5. 

31. If the production function of 14.5 is show that •the 

equilibrium period of production is where r is the market rate of 

interest. In the competitive case, show that r is given by 


- = 161 og 2 s^^. 

32. If x = A >/a6 is the production function, find the amounts of the factors 
used at given prices and Pf, to produce an output x at smallest cost. In the 
case of pure competition on the market for X with the demand law a; = jS - ap, 
show that the demands for the factors are 


a 


P jPb 
aSp^ A^ 


and b 


I 

aSp^ a- * 


33. For any production function, if each output x is obtained at minimum 
cost at given factor prices, show that the common equilibrium value of 

^ and ^ is equal to the marginal cost of output x. If the firm fixes output 

fa fh 

to maximise pr-ifits^at a given selling price p, show that the marginal produc- 
tivity law p^ — pf^ and p^ = holds. Why need not total cost equal total 
receipts in this case? 


34. A good X is produced with two “ substitutional ” factors A and B and 
a “ limitational ” factor G according to the relations x—f {a, b) = i^{c). If 
each output x is produced at minimum cost at given factor prices p^,, p^ and p^. 
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show that the amounts of the factors used and the total cost are determined 
as functions of x and deduce that 

p, 

fa fb ^ ^'(c) " 

The selling price p is fixed and output chosen for maximum profits. Show that 

In what sense can the factor prices be said to be equal to the “ marginal 
revenues** of the factors? (See Georgescu, Fixed coefficients of production 
*ind marginal productivity theory ^ Review of Economic Studies, 1935.) 

If u = is an individuars utility function for two goods, show that 

his demands for the goods are a? = — and t/ = — — i — where and 

a -f j8 ^ 

are the fixed prices and p the individual’s fixed income. Deduce that the 
elasticity of demand for either good with respect to income or to its price is 
equal to unity. 

An individual’s utility function is li = 2gx + 2fy - ax* - 2hxy - by*. Show 
that his demands for the goods are linear in the income p, the coefficients 
depending on the prices of the goods. 


37. Differentiate the equilibriiun conditions giving the demands for two 
goods X and Y for an individual with utility function y) in the case 

where <l>j^ = 0. Show that 


dx 

Wx 


1 

^4>xx 




where 


m^P^.Py* 

<t‘xx ^wt 


and deduce that the demand for a good decreases as its price increases provided 
that and <l>y^ are negative. Why is this statement of the problem not 
independent of the particular utility function selected T (See Pareto, Manuel 
d'iconomie politique, 2nd Ed. 1927, pp. 579-84.) 


38. The incomes of an individual in two years are x© and y^ and his utility 
function for incomes is w = x^y^. Show that the demand (x - x©) for loans this 
year decreases as the given market rate of interest lOOr per cent, increases. 
Deduce that the individual will not borrow this year at any (positive) rate of 

interest if yo<-Xo. 

a 


39. In a self-contained community, an individual can produce, with his 
given resources, amounts of goods X, Y, Z, ... according to the transformation 
function F{x, y, z, ...) = 0 and he can then buy or sell on a market at fixed 
prices p^, Py, p^, .... His utility function is w= <f>(x, y, z, ...). How are the 
amounts he demands or offers determined? Illustrate diagranamatically in 
the case of two goods and show that the solution is exactly analogous to that 
of the problem of the demand for loans (14.9 above). •* 


40. Values (x^) of a variable quantity X are given for a series of (2n-f 1) 
years 

^ <= -n, - (n- 1), ... - 1, 0, 1, ... (n- 1), n 

taking f = 0 at the centre of the period. Write x^^at-^b + where a cmd b 
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are to be determined so that the sum of all observed values of is a minimum^ 
Show that 


a 


Ut* 


and 


b = 


Ex^ 
2n+ 1 


= a; 


where E implies summation over the (2n -f 1 ) observed values. In what sense 
does a: = o< + 6 represent the “ trend ** of the series of A? 


41, The Board of Trade Wholesale Price Index -number (1924= 100) at 

certain dates is given by the following table : 


Date 

1 


July 1929 . 

-3 

82-7 

Jan. 1930 - 

-2 

78.8 

July 1930 - - - 

- 1 

71-7 

Jan. 1931 - . - 

0 


July 1931 - 

1 

61-6 

Jan. 1932 - 

2 

63-4 

July 1932 - - - 

3 

68-8 


Find the trend x=^at-\-h by the method of the previous example. Compute 
the trend values for each date and compare with the actual value of the index. 
Represent the original values and the trend values graphically. 









CHAPTER XV 

INTEGRALS OP FUNCTIONS OP ONE VARIABLE 


15.1 The definition of a definite integral. 

The concept of an “ integral ** has two very different characteristics 
and two correspondingly distinct applications. From one point of 
view, an integral is the limiting value of a certain summation ex- 
pression which is of frequent appearance in mathematical analysis 
and which corresponds, in diagrammatic terms, to an area enclosed 
.by a plane curve or curves. The integral, viewed in this way, is 
called a ‘‘ definite ” integral. From another point of view, an 
integral is the result of reversing the process of differentiation. The 
derivative of a function of one variable is itself a function of the 
8h*me variable. The inverse problem is to obtain, from a given 
fimction, a second function which has the first as its derivative. The 
second function, if it can be found, is called the “ indefinite ” integral 
of the first. 

It is essential, in the following development, to distinguish care- 
fully between the two aspects of an integral, the “ definite ” and 
the ‘‘ indefinite ’’ aspects. The actual definition of an integral can 
be framed in one of two ways and, once one aspect is adopted as the 
basis of the definition, it is then our business to establish the property 
implicit in the second aspect. It is found that the sum-area concept 
of an integral provides the better starting-point. The definition 
given below, therefore, makes use of this concept and it is only at a 
later stage that the integral is connected with the inverse differentia- 
tion process. This is merely a matter of formal pregientation ; the 
sum-area and inverse-differentiation properties of integrals are 
equally important in appfication. 

It is assumed that y=zf(z) is a single- valued function which is 
continuous for all values of x in the given interval from a;=a to 
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z=b* The interval of x, of length (b - a), is divided in any way we 
please into n sections by means of the dividing points : 

^ 2 > ^ 8 > ••• f 

We form the sum : 

/(^) (x% ~Xi) +f(^ 2 ) (*8 -**) + •.. +/(a:„-i) («„ - ®„_i) +/(x„) - x„), 

where each term is obtained from a different section and consists of 
the length of the section times the value of the function at the lower 
(or left-hand) point of the section. For convenience, a special 
notation is adopted in which we write the sum as 

n 

2 fi^r) (^r+l ■” ^r)> 

f-1 

where the term written is the typical (rth) term of the sum and the 

n 

symbol 2 indicates that all such terms, from the first (r = l) to the 

r— 1 

nth (r = n), are to be added together. f The number n of the sections 
into which the given interval is divided is now increased in any way 
such that each section becomes smaller. It can be shown that the 
sum above then increases and approaches a definite limiting value. 
No formal proof of this basic result is offered here but a diagram- 
matic indication of its correctness is given below. The limiting 
value, approached as n tends to infinity, is called the definite integral 
of the function between the lower limit a and the upper limit 6 and 

written in the symbolic form f f{x)dx. So : 

^ a 

rb n 

Definition: /(x)dx=Lim 2 /(a^r) (a^r+i - *.)• 

Jq n— ♦•00 r=l 

The process of finding the integral of a function is called integration. 
From the definition, it is clear that the value of a definite integral 
is simply a number which can bo positive or negative and which 
depends only on the form of the function and on the values of the 

♦ The rigid definition of a definite integral does not assume that the fimction 
is continuous. The full development, following Riemann (after whom the 
integral is sometimes named), is somewhat involved — see Goursat, Cours 
(Tanalyae rmUhimaiiqut, Vol. I (4th Ed. 1924), pp. 169 et seq. The essentials 
of the definition, however, €ure clearly indicated here. 

t This 2 notation is found to be of great convenience in dealing with sums 
of all kinds. The Greek capital 2» ^ alternative to the English is an 
obvious letter to indicate a sum. 
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limits (a and 6) taken. The variable x is not involved ; it has been 

integrated out 

It can be shown, further, that sums other than that written above 
have exactly the same limiting value, the definite integral, as n tends 
to infinity. Such sums are formed by taking, for each section, the 
length of the section times the value of the function at the upper 
(or right-hand) point of the section, or indeed at any point within 
the section. This fact, again not formally proved, is supported by 
the diagrammatic illustrations below. Hence : 

rb n 

fix) dx = Lim E /«) (*r+i - ®f), 

•'a n-><» r— 1 

where can take any value from x^ to x^^^ inclusive. 

Put more broadly, the definition is as follows. The interval from 
a;=atoa:=6is divided into a number of sections of which Ax is a 
typical one with x as one point contained in it. Then 

f /(a:)da;=LimS/(a:) Jaj, 

o 

where each section length Ax is multiplied by the value of the 
function f{x) at a point of the section, the whole set of products is 
summed and the limit found as the number of sections into which 
the given range (a, 6) is divided is increased indefinitely.* 

• The following properties of definite integrals are derived at once 
from the definition. If f(x) and <f>{x) are single-valued functions 
continuous over the relevant intervals, then i 

(1) I fix)dx- f(x)dx. 

(2) \ {-fix)}dx=: -{ fix)dx. 

(3) f kf{x)dx = k{ f{x)dx (A: being a constant). 

•'a •'a 

(4) { fix)dx={ fix)dx+{ fix)dx. 

J Q J Q J m ^ 

* The way in which the integral notation has arisen is clear from this rough 
statement of the definition. In the limit, the sum is written Sf(x)dx, 

S being an alternative symbol for 2* The S then becomes elongated into the 

oonvenAonal sign [• 
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(®) I f{x)dx+j ^(x)dx. 

The last of these properties can be considered as an illustration of 
the formal method of proof. We have 

2{/(^r) + ^ (^r)} (^f-n “ ^f) == 2 {/(^r) (^r+l “ ^r) + ^ (^r) (^r+l “ ^r)} 
r-1 f“l 

n n 

= 2 f{Xr) {Xr+i -Xr)+'2<l> (Xr) {Xr^i “ X^), 

i.e. I {f(x)-h(f>(x)}dx=\ f(x)dx-\-\ (f>{x)dx 

J a J a J a 


on taking the limit as n tends to infinity and using the property 
(4.5) that the limit of a sum is the sum of the separate limits. It 
can be noticed that this result extends to the integral of a difierence 
and, in an obvious way, to the integral of sums or dififerences of any 
number of separate functions. 


15.2 Definite integrals as areas. 

The diagrammatic representation of a definite integral which 
follows makes the concept much more clear and, at the same time, 
leads to important applications. Suppose, first, that b>a and that 



cl A CD B X 

Fia. 92. 


the continuous^ fujiction f(x) assumes only positive values between 
x^a and x = 6. Further, for simplicity of exposition, take the 
function as monotonic increasing. The function is then represented 
(Fig. 92) as a continuous curve lying above the horizontal axis Ox and 
rising from left to right between the ordinates AP (at x=a) and 
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BQ (at a: =6). Let CD be a typical member (*„ ar^+i) of the n 
sections into which we divide the range AB (n = 8 in the figure). The 

n 

contribution of this section to the sum 2 / (a^r) (»r+i - *r) is CD times 

f— 1 

CR, the ordinate of the curve at C, i.e. it is the area of the rectangle 
with CD as base and GR as height. The whole sum is thus the sum 
of the rectangular areas shaded in Fig. 92, an irregular area lying 
entirely under the curve between P and Q. As the number of 
sections is increased, each section becoming smaller, the shaded 
area, comprising an increasing number of rectangles becoming steadily 
thinner, must increase and approach a limiting value which we can 
identify as the area under the curve, above Ox and between the 
ordinates AP and BQ. Hence, the sum written above has this 
area as limit and 

f f(x)dx = 8bTeSi. ABQP under the curve y=^f{x). 

^ a 

It is also clear that the same result follows if we start from one of 
the other sums used in the definition of the integral. The sum 

fi 

2 ^ represented by a sum of rectangle areas, a 

r— 1 

typical rectangle having CD as base and DS as height. We have 
now an irregular area enclosing the curve and, as the number of 
sections is increased, the area decreases and approaches a limiting 
value which is the area ABQP under the curve. Further, the sum 

2 where x^ is any value between and x^^^^ is 

f— 1 

shown as the sum of rectangle areas, a typical rectangle having base 
CD and height equal to some ordinate of the curve abov** CD. This 
area again approaches the area ABQP under the curve as the number 
of sections increases. It does not matter which of the sums is taken ; 
the limit is always the definite integral shown by the area ABQP 
under the curve.^ 

• It can be noticed that we have actually given here a definition of the area 
under a curve. The only areas known to elementary geometry are those of 
rectangles. But, if we fit to a given ciirvilinear area a set oiT rectangles in one 
of the ways indicated, and if the sum of the rectangle areas tends to a limit as 
the numl^r of the rectangles increases, then the limit can be defined as the 
curvilinear area. Compare this definition with the graphical method of esti- 
mating a curvilinear area in practice, by coimting the number of small squares 
of the graph paper covered by the eurea. 
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A similar argument applies when the function, though positive, 
is not monotonio increasing between x=a and x=6. The integral 

f f{x)dx can always be represented as the area under the curve 

a 

y=f{x), above the axis Ox and between the ordinates at x=a and 
a; =6. This is illustrated in Fig. 93. 




When the function f{x) assumes negative values between x=a 
and x=b{b>a)y complications arise which can only be overcome by 
making conventional distinctions between positive and negative areas. 
lSf(Xr) is negative, then the contribution of the section (x^., to 

n 

the sum 2 /(^r) (^^r+i - ^r) is negative and shown by a rectangle below* 

the axis Ox. It follows, when the limit is taken, that the integral 

[ is stiU represented by an area between the curve y—f(x)^ 

the axis Ox and the ordinates at a;=a and a; =6, provided that any 
parts of the area above the axis Ox are taken as positive and any 
parts below the axis Ox as negative (see Fig. 94). The inte^al and 
area as a whole may, of course, be positive or negative.* 

So, if the function y=zf{x) is positive in the interval (a, b) and the 

curve above Ox^ then the integral f f{x)dx is positive and measured 

^ a 

» • . 

* It must be emphasised that the integral \ J{x)dx is here not the sum, but 

' o 

the difference, of the numerical areas PAG and QBG of Fig. 94. If the 
numerical sum is required, the areas PAG and QBG must be obtained separ- 

( 0 fb • 

f{x)dx and - 1 f(x)dx, where OG^o. 
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by an area under the curve. If the function is negative in the 
interval and the curve below Ox^ then the integral is negative and 
numerically equal to the area below Ox and above the curve. If the 
function changes sign in the interval and the curve crosses the axis 
Ox^ then the integral may be positive or negative and is represented 
by the algebraic sum of a positive area (above Ox) and a negative 
area (below Ox), 

A further convention regarding the sign of an area is needed to 
interpret the integral f f(x)dx as an area in the cases where a>b, 

J a 

Each of the terms (ar^^i - x^) appearing in any of the sums defining 
the integral is now negative instead of positive and the corresponding 
rectangle areas are described from right to left instead of from left 
to right. We simply need the convention, therefore, that an area 
described from right to left is numerically equal, but opposite in 
sign, to the similar area described from left to right. 

In all cases, the integral f f{x)dx is measured by the area be- 

^ a 

tween the curve y=f{x)y the axis Ox and the ordinates of the curve 
a,t x=a and x=b. Various parts of the area must be considered as 
taking signs according to the conventions here indicated. The area 
as a whole is the algebraic (and not the numerical) sum of the 
separate parts and it may be positive or it may be negative. 


15.3 Indefinite integrals and inverse differentiation. 

In introducing the second aspect of integrals (as inverse to 
derivatives), we can quote a simple example of the kind of problem 
involved. The variable y is known to be a single-valued fimction 
of X suoh^that, for all values of x, we have 


dx 


= 1 +a;* 


This is a “ differential equation ” involving the derivative ^ of a 
C ' * 

fimction so far yn^^^wn. Is it possible to deduce the actual form 
of this unkijpvfn function from the equation? We know that 


dx 


{x)^l 


and 



So 
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^(r+Jx3) = l+X*. 

One possible form of the function we seek is thus 


t/=a: + |x®. 

This is obviously not unique since the addition of any constant to y 
leaves the derivative unaltered. So, if c is any constant whatever, 

y=x-\-\x^ + c 

is a form of the function required. It is also fairly clear that we can 
expect no other form of y to have the derivative at all 

points. In a particular case such as this, it is possible to reverse the 
differentiation process, to solve a “ differential equation of the 
above type. Notice that the additive constant, which disappears on 
differentiation, reappears in the reverse process. We have now to 
generalise this step and connect it with integration. 

The definite integral f f{x)dx depends for its value on the form 

J a 

of the function f{x) and on the values of the limits a and 6. If the 
form of the function and the lower limit a are given, take a variable 


number x as the upper limit and write the integral j f{x)dx. The 

^ o 

• • 

value of the integral now depends on the value allotted to x, i.e. the 
integral is itself a function of x. It is called the indefinite integral of 

the function f{x) and written simply J f{x)dx. So : 

Definition : j/(x)dx=| f{x)dx = a. function of x. 


By indefinite integration, we obtain, from a given function of x, a 
second function of the same variable. In diagrammatic terms, the 
indefinite integral is a variable area, i.e. the area between the curve 
y=f{x)y the axis Ox, the fixed ordinate at x=a and a variable 
ordinate at x. 

The fundamental result connecting differentiation and integration 
can now be stated : 


If/(x) is a continuous function, then the derivative of the indefimte 
integral of f{x) is the function f{x) itself : 
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Without giving a completely rigorous proof, the correctness of this 
result can be seen in diagrammatic terms. Let F(x) denote the 
indefinite integral of the continuous function f(x). Then 

rx+» tz 

+ -i^(x)=J f{x)dx-^ f{x)dx=j f{x)dx 

by property (4) of 16.1. The last integral written is shown by the 
area between the curve y=/(a:), the axis Ox and ordinates at x and 
(x^-h). If A is small, the area differs by a small amount from that 
of the rectangle with base h and height /(a:), the ordinate of the 
curve at x. So, if A is small, 

F{x-\-h)-F {x) =hf{x) approximately, 
i.e. — - =/(^) approximately. 


It follows that, as A-^0, the ratio on the left-hand side tends tof{x), 
i.e. the derivative of F (x) is f(x ). ' q.b.d. 

The fundamental result can be put in a more practical form. If 
a function <f)(x) can be found with j{x) as its derivative, then 

and ^f{x)dx and <}>{x) must be identical except for an additive and 
arbitrary constant : 

\f{x) dx = ^ (a:) + constant. . 


The integral here is between limits a and x. Putting x=a, 


I f(x)dx = <f> (a) -f constant. 

J a 


The integral now is zero, representing an area on a base of zero 
length. It follows that 

constant = -r ^(a), 

i.e. the arbitrary constant of the indefinite integraLcaj be written in 
terms of the arbitrary and constant lower limit of integration. Put 
x^b in the indefinite integral and substitute for the constant. So 

f /(a:) da: = ^ (6)4- constant = ^ (6) (a). 
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The practical rule for integration is thus : 

It fix) is a continuous function and H ~ {<j>ix))=fix), th«i 

CLX 


^fix)dx = <i>{x) -I- constant 
[ f{x)dx = <f>ib)-4>{a). 

J a 


To integrate a given function f(x), it is only necessary to find, by 
some means or other, a function wiihf{x) as its derivative. 
The fundamental result can be put in yet a third way. If it is 

given that the variable y is a function of x such that 
where /(a;) is a given continuous function, then 

and y and ^f{x)dx can differ only by a constant. Hence : 


The solution of the differential equation — =/ (x) is 

Q/U/ 


f=^f{x)dx 


+ constant. 


We have thus established the fact that integration is inverse 
differentiation. If the function f{x) is obtained from <f>{x) by deriva- 
tion, then the function (f){x) is obtained from f{z) by (indefinite) 
integration. The only difficulty is that an arbitrary constant re- 
appears in the integration process. The same fact is expressed by 
saying that the integral off{x) provides the solution of the differential 


equation 


dx 


=f(x). 


The concept of a differential equation wffl be 


greatly extended in the following chapter and the idea of integration 
as inverse differentiation will be generalised to the idea of the 
integration of differential equations of all kinds. 


15.4 The technique of integration. 

The practical method of evaluating a definite or indefinite integral 
depends, as we have just seen, on finding a fimction with the function 
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to be integrated as its derivative. If ^ then we can 

write at once 

jf{^) dx = <l> (x) + constant ; J f(x) dx = <l){b) - <f> (a). 

In attempting to systematise the practical method on lines similar 
to those that proved successful in the case of derivation, we first 
obtain and table some standard forms. From the derivative results, 


we have 



-( 
dx ' 

1 =x^ 

vn + l/ 

' (n^-1); ^(loga;) 

Reversing these results, we have the three i 
integrals i 


(1) 

r 7 ^ . 

\ x^dx = + constant 

1 71 -h 1 


(2) J 

\-dx = log X + constant. 


(3) J 

|" 6* (fa; = c* -h constant. 


Furtlier, i{f{x) is any continuous function of x, then 



A 

dx 




(w - 1) ; 


= |^(e«)=/-(x)e/<->. 

^he standard forms for integrals can thus be generalised : 

(!') {/(^)}" dx = h constant 1). 

(2') J dx = log f{x) + constant. 

(3') dx = cl + constant. 


Thp second step is to frame, if possible, a set of rules for the 
integration of combinations of functions, functions of functions and 
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80 on. Two such rules can be written down at once from properties 
(3) and (6) of 16.1. For indefinite integrals, we have 

^kf{x)dx=^k^f{x) dx 

and J {fi(x) +fi{x)} dx=^f^{x)dx+jfj{x)dx. 

By an obvious extension and combination of these results, we have 

J + ••>}dx=<iy^f^{x)dx + at ^U(x)dx + ... . 

There is no difficulty in integrating an expression consisting of sums 
or differences of simple functions. But we can proceed no further ; 
any attempt to frame other rules of general applicability, for the 
integration of products, quotients or functions of functions, is 
doomed to failure. The practical process of integration can be made 
neither as systematic nor as complete as in the case of derivation. 
In integrating, we are driven back to tentative “ trial and error 
methods, aided by a few more general devices (such as that of 
“ integration by parts ”). In each case, the most important point 
to remember is that any suggested form <l>[x) of the integral of f(x) 

can always be checked by derivation: 

following illustrative examples, the additive constant that appeai^g 
in all indefinite integrals is omitted for convenience of writing. 

Ex. 1. ^ Jxdx=-^x^ by (1) above. 

Ex. 2. + 3a; + 2) dx = - + 1 + 2a; 

* r 11 

and I (ax^ + 6x + c) dx = - ax^ + ^ + cx (in general). 

C If 1 (ct/X "f" 

and (ax + 6)" dx = - a (ax +b)”dx=- — (in general). 

J « • aj a n + 1 

Ex. 4. J^ = log(l+x), by (2'), 

f-^^^=-log(ax + 6) (in general). 

Jaa:+6 aJox + 6 a ° 
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* (2aa: + b) dx 
ax^ -\-bx+c 

•x^-f3x-2 , 


= log {ax^ + 6a; + c) (in general). 


Ex. 6. J*“~~ — ^ + 

Ex. 7. ^e^dx^^^2e^dx^^e^, by (3'), 
and j^ax+b dx=^ Ja6®®+^ ~ i (in general). 

Ex. 8. 1*^6“®* ^ da; = - i e“**, by (3'). 

Ex. 9. ^x^e-^dx. 

Since ~(a;e“-**) =«”** 4-a;( -2xe-**) = (1 -2a:*)e“**, 

80 a;e~**=J'(l -2x*)e~**da; = Je“**dx -2jx^e“**da;, 

i.e. jxh -^ ’ 

I’he integral though it is known to exist, cannot be evaluated 

in terms of ordinary functions. We have, therefore, reduced the given 
integral to as simple a form as possible. 

15.5 Definite integrals and approximate integration. 

The integral of a function f{x) between given limits is evaluated 
once the indefinite integral ^{x) is foimd as a function of x. The 

formula J f{x) dx = (f){b) - (f) (a) suffices. For example : 

Areas under plane curves can then be evaluated at once. The para- 
bola with equation y=x^ + Sx^2 lies above Ox for positive values of 
X, The area under the curve between ordinates at ’a; =• 0 and a; = 3 is 


I* (x* + 3a; + 2)da;=|^ 


x* + -x2 + 2a;J 


-[i»* + |»> + 2*] =^ = 28-5. 



INTEGRALS OF FUNCTIONS OF ONE VARIABLE 397 


have noted, however, that the indefinite integralfl of some 
funotions, even of quite simple form, are often extremely difficult, 
if not impossible, to evaluate.* If the definite integral of such 
a function is required, for example, to give the area under the cor- 
responding curve, it is necessary to adopt methods other than that 
described, A number of formulae giving the approximate values of 
definite integrals have been devised and are of service in the present 
problem. All these formulae depend on replacing the area under the 
actual curve considered by that under a simpler curve which approxi- 
mates to the actual curve. 

Suppose that the function f{x) is continuous and positive in the 
interval (a, b) and that we require the value of [ f(x)dx. The 


interval (a, b) is divided into some even number (2n) of parts of equal 


length A = 


b -a 


This is shown, m Fig. 96, by the points A^, Ag, 



^ 2 n+i on the axis Ox, where OA^=a and OA^n+i^b and 
AiA^^A 2 A^ = ... =h. The ordinates Vi^A^P^, t/a=^ 2 ^ 2 > ••• t)® 

read off the curve The integral f f(x)dx is the total area 

^ a 

* It is not possible, for example, to find dx in terms of ordinary func- 
tions, The integral Exists and must be a function of x. But the function is 
not of algebraic form ; it is, in fact, a “ transcendental ** function which can 

only be defined as taking its values from | c”*** dx for various values of x. It 

is a frequent device, in mathematical analysis, to define new functionf^ as the 
integrals of other functions. 
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under the curve and on the base AiA^n+n i ©- the sum of the areas 
under the curve and on the bases AiA^y -4 2 ^ 3, ... . 

As a first approximation, the area under the curve on the base 
AiA^ can be taken as the area of the trapezium A^PiP^A ^ : 

P(yi+y2)- 

Similarly, the approximate areas under the curve on the other bases 

iA(2/a + !/8), IHys + yih-- 

The approximate value of the total area, i.e. of the integral required, 
is foimd by addition. Hence, 

f f(z) dx = + t/ 2 n+l) 4- 2 (yj + !/3 + • • • + J/ 2 n)} 

^ • 

approximately, the trapezoidal rule for approximate integration. 

A closer approximation to the area under the curve on the base 
A^Af^A^ is given by the area under the arc of the parabola (with axis 
vertical) passing through the points Pa and P,. To calculate 
this latter area, let the equation of the parabola be 

y^oac^+^x + y 

when the origin for x is taken, for convenience, at A^. (This step 
does not affect the value of the area required.) The curve passes 
through Pi( - A, yi), Pa(0, y*) and Pg(h, y,). Hence, 

yi = (xA*-)3A + y, ya=y. y3 = aA*+/3A+y. 

Solving for a, )3 and y, we find 


yi-^Vi+Va o_y»-yi 

2h^ ’ 2h 


and y=ya. 


The area under the parabola on the base A^AiA^ is 
f {(xx*+Px+Y)dx = W(xx* + ]rPx^+Yx\ -r|aa;*+^j3x*+ya:l 

J-h l-d Z Ud Z, * 

= I oA* + 2yA = ^ (2/1 + ^y% + J/s)- 

The approximate areas under the curve y=/(x) on the bases A^A^Ag, 
AfAfAj, ... are obtained similarly as * 

^{y»+^yt+yt), |(y*+4j/*+y7), .... 

By adllition, we have the approximate area under the curve on the 



INTEGRALS OP FUNCTIONS OF ONE VARIABLE 399 

total base, i.e. the approximate value of the integral required. 

Hence, 

f* h 

J /(a:) da: = - {(y^ + + 2 (t/j + ^5 + • • • + 2/2n-i) + 4 (yg + 2/4 + . . . + y^n)) 

approximately, Simpson's rule for approximate integration. 

Two examples illustrate the rules obtained : 


Ex. 1. To find an approximate value of log 2 = 
range (0, 1) into four equal parts by the points 

a:i = 0, a :2 = J, a;j = i, = l and a:6 = l. 

The ordinates of y^ j ^ — at the points of division are 

yi = l. ^1! = #. y» = h and = 

The trapezoidal rule (h = gives the approximate value 
ri dr 

log 2 = tt:: = i{(l + i) +2(1 + § +f)} =0-6970. 

Jot +a: 

Simpson’s rule gives the approximate value 
ri dx 

log 2 = r y^=*{(l + i) + 2(1) +4(1 +-t)} =0-6933. 

Jo A -tx 

Five-figure tables give the correct value of log 2 as 0-69315. The 
nature of the approximation of the two rules is thus seen. The rules give, 
closer approximations, of course, if the range (0, 1) is divided into more 
than four parts. 

Ex. 2. The area imder the curve y=e“**, above Ox and between the 
ordinates a; =0 and a; = 2 is given by the integral 

2 

e-^dx, 

0 


f :r— I we divide the 
iol+x' 


Dividing the range (0, 2) into ten equal parts the ordinates at 

the elqven points of division are found, from tables, as 

= 1-0000, yg =0-9608, yg =0-8521, ^4 =0-6977, 
yg =0-5272, yg =0-3679, y^ =0-2369, yg =0-1409, 
yg = 0-0773, y^o =0-0392 and y^ =0-0183. 

The trapezoidal rule gives the approximate value 

dz = i^{l-0183 + 2 (3-9000)} = 0-88183, 

and Simpson’s rule gives the approximate value 



da: =iij{l-0183 +2(1-6935) +4(2-2065)} =0-88209. 


The required area, to three decimal places, is thus 0-882 


0 
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15.6 The relation between average and marginal concepts. 

The demand of a market for a good X can be represented by three 
functions and curves showing total, average and marginal revenue 
respectively for various outputs of X. The relations between these 
functions and curves can now be examined in more detail. In Fig. 
96, P and Q are points on the average and marginal revenue curves 



corresponding to a given output x=OM. Denote average revenue 
(price) by p=MP and marginal revenue by p„=:MQ. Then the 
, total revenue from the output x is 

B—px=0M . MP, 

i.e. the area of the rectangle OMPN. Further, 

dR 

dx 


I.e. 


■r 

Jo 


JS = constant. 


The arbitrary lower limit of the integral is here fixed at zero output. 
Put x = 0 ; then the constant must be zero since R is zero at zero 
output. Hence, 


Jo 


dx^ 


Le. the area under the marginal revenue curve, above Ox and between 
the ordinates at 0 and M. As long as the marginal revenue curve is 
above Ox, this area is positive and increases as x increases. When 
the curve falls below Ox, part of the area must be taken as negative 
and \he total area decreases as z increases. 
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The total revenue from any output can be read off as an area 
either from the average or from the marginal revenue curve. We 
have a relation between the two curves at any output. No matter 
how the position of M varies, the area of the rectangle OMPN is 
always equal to the area under the marginal revenue curve on the 
base OM. 

A similar result holds for any pair of average and marginal curves. 
For example, if tt and denote the average and marginal cost of an 
output X oi B, good, TT and tt^ being functions of x, and if the cost of 
zero output is zero,* then total cost is 

n =xtt = I TT^ dx, 

Jo 

Total cost is read off the average cost curve as a rectangular area 
and off the marginal cost curve as an area under the curve. 


15.7 Capital values. 

It has been shown that the present or capital value of an income 
stream (e,g. from land or a macliine) consisting of this year and 
fuj, £a 2 , ... hi ^ following years is 


m fj 

where interest is reckoned yearly at lOOr per cent, (see 9.7 above). 
This calculation can be generalised to allow for a continuous income 
stream and for interest computed continuously. 

It is assumed that income is obtained continuously over time, 
the rate at any time t years from the present being £f(t) i)er year. 
This implies that, in the small interval of time between t and {t-)r At) 
years from the present, the income obtained is approximately 
£f{t)At. Interest is computed continuously at the rate of lOOr per 
cent, jfer year, where r can vary over time as a function of t. The 
present value of the income in the small time-interval (^, t+Jt) is 
£/(/) At approximately. If the income stream “ dries up ” x years 
from the present, the approximate present value of the stream is 

where the summation extends over all small intervals At from < = 0 


* If the cost of zero output is Uq, it is found that the constant in the integral 

! » 

TT^ dx, i.e. the eirea under the marginal 
® • 
cost curve, is the total variable cost of the output x. 


o 


M.A. 
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to t==x. As the number of time-intervals is increased, each interval 
becoming shorter, this approximation becomes closer. The exact 
present value is defined as the limit of the sum as the number of 
intervals increases indefinitely, i.e. as 

2 /=[ f(t)e-^*dL 
J 0 

This is the capital value of the stream estimated at the present time 
(< = 0). Thus, in passing from a discontinuous to a continuous 
income stream, the capital value changes from the sum of a finite 
number of terms to the limiting case of a sum as represented by an 
integral. We have an immediate application of the sum-area aspect 
of an integral. 

The result obtained holds for an income stream and a rate of 
interest varying in any way over time. In the particular case where 
income is derived at a constant rate of £a per year and where the 
rate of interest is fixed at lOOr per cent, per year over time, we have 

t/=:aj* = a ^ t "" r 


,i.e. y=“(l-e-"). 

The capital value thus depends, in a simple way, on the size of the 
income stream, the number of years it flows and the rate of interest. 

Problems of capital accumulation can be treated in a similar way. 
If £ao, foi, fua, ... £am invested in the present and in succeeding 
years at lOOr per cent, compounded yearly, the total sum accumu- 
lated at the end of the period is % where 

m 

r-o 

For a continuous investment which, t years from now, is made at the 
rate of £f{t) per year, the total sum accumulated at a time x years 
from now is found, by an argument similar to that 'above, to be 

Jo 

whefe interest is added continuously at lOOr per cent, per year. In 
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the particular case where the investment £a and the interest rate 
lOOr per cent, are both fixed over time, we find 


«=o f e’’<*-‘>df=a -a , 

^ Jo *- r L r J^_o' 


i.e. 




This expression is simply e*"* times the previous expression giving 
the present value (at t — 0) of the same constant income stream, 
i.e. the accumulated value of the constant income stream after 
X years is the present value itself accumulated at the fixed rate of 
interest. This result does not hold when the investment or the 
interest rate varies over time. 

As an application of the results above, suppose that the production 
process of 14.5 above is carried on continuously over time. A 
constant expenditure at the rate of (ap^ + ^Pb) ^^il^ of time is 
made contmuously and the product x of each investment is realised 
after the production period of t years. The total accumulated value 
of all investments in a period of t years is 

r 

where interest is reckoned at the fixed continuous rate of lOOr per 
cent, per year. The sum k can be interpreted either as the accumu- 
lated amount of loans needed to start the production process (be- 
tween the original investment and the appearance of the first product) 
or as the total value of aU “ intermediate ’’ product existing at any 
time. In any case, k represents the amount of “ working ’’ capital 
needed to keep the production process running continuously. In 
the competitive case, where the market rate of interest is such that 

x=^(apa-hbp^)e^*, 


we have 


k = 


^-((^Pa + bp^) 


If x=Aa^b^-^t^ is the production function, then (from 14.5) 

• • 

r 

and the capital needed varies inversely with the competitive level of 
the market rate of interest. * 
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15.8 A problem of durable capital goods. 

The general problem of durable capital goods is complex but can 
be simplified, at a first approach, by making the following assump- 
tions.* A capital good (e.g. a machine) is produced at a cost of £f(t) 
dependent, in a known way, on the length t years of its active and 
continuous life. An entrepreneur, bu 5 dng the good at its competitive 
(cost) price, obtains from its use a constant and continuous income 
stream of £a per year. At the time of purchase, the discounted 
(capital) value of all future product obtained from the good during 
its life of t years is 

aj c-^‘d<=“(l -e-’’*), 


where lOOr per cent, per year is the constant and continuous market 
rate of interest. It is assumed that the entrepreneur chooses the 
length of life of the good to maximise the capital value as a propor- 
tion of the cost : 




a(l -c-*"*) 

rm 


Here 


dt 




»•{/(<)}* I 



The necessary condition for the optimum length of life is thus 


e'’‘ = l+r 


m 

/'(O’ 


a relation giving t in terms of the given rate of interest. 

In the normal case, the cost/(<) increases, but less than proportion- 
ately, with the durability (t years) of the good. As an actual example 
of a cost fimction of this type, take /(/) =bjt, where 6 is a constant. 

Here and the equation giving the optimum t in terms 

off is 

e* = l+2rt. 


* The analysis is based on the work of Wicksell following Gustaf Akerman. 
See Wicksell, Lectures on Political Economy, Vol. I (English Ed. 1934), pp. 274 
et seg. 
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Write x=^rty and we have 

e* = 1 + 2x. 


Solving by graphical methods (2.8 above), it is found that x has one 
value (other than zero) which is approximately x = 1*27. So 


< = 


1-27 


r 


approximately. 


The length of life of the capital good, as selected by the entre- 
preneur, varies inversely with the rate of interest. If the market 
rate of interest is 3 per cent., the optimum length of life is over 
42 years, but at 4 per cent, the length of life is not quite 32 years. 

Suppose now that production is carried on continuously, the 
capital goods being brought into use continuously and uniformly at 
the rate of n goods per year. After an initial period, it follows that 
the goods are scrapped at the same rate, each having served its 
optimum life of t years. At any moment, there are nt goods in use 
with ages varying uniformly from 0 to t years. The number of 
goods in use with between x and x-\-Ax years of life to run is approxi- 
mately nAx. The value of the future product of one such good, 
discoimted to the moment considered, is 


r 

The total present value of all goods in use is approximately 

S^(l - c-*'®) 71 zJx, 


the Buip extending over all intervals Ax from 0 to <. Taking the 
limit for the continuous range of ages of the goods, the exact present 
value of all capital goods in use at the given moment is 


* •/ V 

r \ r rJ 


This capital value, like the optimum life of each good, is dependent 
on the given market rate of interest. 
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15.9 Average and dispersion of a frequency distribution. 

We are given N items showing values of a certain attribute 
according to the frequency distribution : 


Value of attribute 

- 

.. 

.. x^ 

Total 

Number of items - 


Vi y% 2/8 •• 

• yn 

II 





r-l 


For example, the families in a certain town may be distributed 
according to the number of rooms occupied. The attribute, number 
of rooms occupied, takes the values x^—l, X 2 = 2, 0:3 = 3 , ... . Then 
f/i represents the number of families occupying one room and so on. 
The average value of the attribute is defined as 

=5,5.’''*' 

and the standard deviation of the attribute as (?* where 

These are familiar statistical figures, the average indicating a 
central value and the standard deviation indicating the “ spread 
or “ dispersion ” of the attribute in the group concerned. 

Most of the variable characteristics considered statistically are 
not of this discontinuous nature. They are so finely divisible as to 
be measured by values which can be taken as continuous variables. 
The ages of a set of school children, the rents paid by a group of 
families and the death-rates in a number of districts are diverse 
examples of continuously variable characters. In practice, a 
number of items distributed according to such a characteristic is 
shown in definite groups or grades of the variables. Families, for 
example, may be arranged in grades of rent between 2 and 3 shillings, 
between 3 and 4 shillings, and so on. This is, however, a rough 
method and the results obtained are affected by the nature of the 
grading. Something more precise is required for a theoretical treat- 
ment. It is assumed, therefore, that the frequency distribution of a 
number of items is represented by a continuous function y—f{x).. 
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implying that the number of items with value of the characteristic 
between x and (x-hAx) is approximately /(x)Ja;, the approximation 
being closer as the value of is made smaller. The total number 
of items is N^i:f(x)Ax 

approximately. Taking the limit, we have exactly 


^ = f f{x)dx, 

a 


where a and b are the lowest and highest values of x found. 

The approximate values of the average and standard deviation 
of the variable in the distribution are x and a* where 


x=^ — Y,xf{x)Ax and 


- xYf{x)Ax. 


Taking the hmit for an exact representation, we have 
1 « f xf(x)dx 

f{x)dx 

^ a 

1 p f (x-x)’^f(x)dx 

and = 

f{x)dx 

J a 


We have here another direct application of the sum definition of an 
integral. 

For example, Pareto’s income law asserts that 

. y = — (A and a positive constants) 

represents the number of persons with an income of £xy a continuous 
frequency distribution of persons according to income. The number 
of persons wdth income between £a and £b is 


f* r 

■_^n 

-1 


- OC ft 

- a x^X. 


The average income of these persons is 


-if 

ivJ. 


X -4-: dx = 


Ni.x^ 


1 A f I _ 2_Y • 
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Take a=x and let h tend to infinity. The number of persons with 
income over Sx is thus 

N=- - 

a 

and the average income over £a; is 

- 1 A 1 _ a 

^~N a-1 

The law has the property that the average income over £a; is a 

constant multiple ^ ~ of far. In practice it is found that a = 1*5 

approximately. Hence, the average income over far is approxi- 
mately f 3ar, a result which agrees well with observed data of income 
distribution, at least for large incomes. 


EXAMPLES XV 
Integration 

1, From the sum definition of an integral, show that f ldx = b~a. What 
is the indefinite integral of 1 ? Hence check the result. 

n Xa? “f" 3/ \ 

. , 2* If /(a?) is continuous, the smn 2 / J - ar,.) has j^/(a;)ciar as 

its limiting value as n increases indefinitely. Deduce that 1 a;da; = J(6* - a*) 

• and check from the indefinite integral of x, ® 

3. If /(ar) = «'*, check that f /(ar)dx=( J(x)dx-{-\ f(x)dx for any values 
of a, b and c. 

4. Show that | J{x)dx^ = -f(x), (Use property (1) of 16.1.) 

• 6. Evaluate the indefinite integrals of 

. 1 ^ ^ t ® aa: + 6 

a?», — , l-a:* + a;S aa? + 6 + -, — — . 

i^X X ttjx 

6. Use the results (l')-(3') of 16.4 to evaluate the integrals of 

n 1 1 2ar-fl 1 ./• 

ir^' (3* + 2)«’ + * ' *« > ' 

7 . Show that |y-^^<ie=log(l + .•) and ^^^^^dx = log(e’‘-e~*). 

8« From the integral of the general expression (aa5 + 6)", show that 
|(1 4-ar)*cia; = ^(l -f-ar)*. Expand (1+a;)* m powers of x and show that 

j ( 1 4- i) * da: = a? f X* + ~ a:*. Why are the two results apparently different ? 
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d/u 

9* If ^ = log a?, check that y =a;(log as - 1) + constant. 

f -.1 (ic-6)* 

10. Show that J (a; ~ 6) e 2 a* clx= -a*e ^ ®* . 

11, Verify that -J— = ~ ( — L and deduce that 

as* •“ Jl « xa? “ J, a? "f" 1/ 

f da; 1 a;- 1 

]a:*-l"2 

1 a* 
a® da; = . 

13. Find |^2*da;. Check by drawing an accurate graph of y = 2® and esti- 
mating (by counting squares of the graph paper) the area under the curve, 
above Ox and between the ordinates at a; = 0 and a; = 2. 

14. Find, by integration, the area under the straight line f/ = a;+ 1, above 
Ox and between the ordinates a;=0 and a;=l. Represent graphically and 
check by calculating the area of a trapezium. 

1 14 r«-l dx 

16. Show that j^a;»da; = j, J^Va;da; = — , 

16. Find the area under the parabola y^x^, above Ox and between the 
ordinates at a;= - 3 and a? = 3. Verify that it is twice the area between the 
ordinates at os = 0 and a; = 3. 


17. Show that the area between the rectangular hyperbola xy^a^ and the 
axis Ox and between ordinates at a; = a and a; = 6 is a* log - , if o and b have the’ 

Of 

same sign. Why does the result not hold when a and h have opposite signs 7 

f2 

18. Evaluate j^{a;* - 4a; + 3)da; €uid check that it equals the difference 

between the area between the parabola y=a;*-4a;+3 and Oa; from aj=0 to 
a;= 1 and the area between Ox and the parabola from 3?= 1 to a?= 2. Illustrate 
graphically. 

19. By tcddng the difference between two integrals, show that the area 
enclosed between the peurabolas y = x* and >/a; is i. 


• fa f« 1 f« 

20. Show that \ a;e“**da; = 0 and I a;*e“** cZa; = - 6“** da; for any value 

J -41 J— a iSJ-a 

of a (see 16.4, Ex. 9, above). 

fl dx 

21. Calculate approximate values of by the trapezoidal and 

Simpson's rules, dividing the range (0, 1) into ten equal parts in each case. 
The value of this integral is known to be \tt. Hence estimate the closeness of 
the two approximations. 

22. If the velocity of a body at time I is v=f(t), show that x — ^^f{t)dt 
is the distance travelled from zero time to time t. Find the distance when 
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23. The acceleration of a moving body is a constant a over time. Shovv 
that, after time the velocity iav = u-\-at and the distance travelled a: = 
where u is the velocity at zero time. 


Integrals in economic problems 

If the marginal revenue of output x is = a - bx, find the total revenue 
function by integration and deduce the demand function. 

ab , 

If the marginal revenue function is p^ — - c, show that 

p = - c is the demand law. 

^ x + b 


26. Find the total cost function if it is known that the cost of zero output 
is c and that marginal cost of output a; is 7r^ = oa; + 6. 

( 27 ) If n^ = — — — is the marginal cost function and if the cost of zero 
^ " s/ax + b 

output is zero, find total cost as a function of x. 

28. If interest is reckoned continuously at 4 per cent, per year, show that 
the capital value of uniform income stream of £100 per year for 10 years is 
approximately £824 and for 100 years approximately £2,454. (e = 2-71828.) 

29. Write down the capital value of a uniform income stream of £a per year 
for X years, reckoning interest continuously at lOOr per cent, per year. By 
letting a;->oo , deduce that the capital value of such an income stream going 

on for ever is £ - . 

r 

30. An income stream decreases continuously over time for x years, the rate 
b*eing £a per year at t years from now. Find the capital value at lOOr per 
cent, continuous compound interest. Show that this equals the capital value 

^ of a uniform income stream of £a per year for x years if the rate of interest is 
raised to 100 (r + 6) per cent, per year. 

31. An income stream decreases continuously for x years. At time t years 
from now, the income obtained is at the rate of £a(l - 2a0 per year and the 
continuous rate of interest is 100r(l - a^) per cent, per year. Show that the 
capital value of the income stream equals that of a steady and continuous 
ii^come of £a per year for x years at the constant rate of interest, computed 
continuously, of 100r(l - <xx) per cent, per year. 


32. In the problem of Examples XIV, 31, show that the value of all inter- 
mediate products at any time is 



where - = 16 log A. In the competitive case (r being such that ap^ + bp^ = xe"'^*) 


show that 


k=^lQx 



'=) 




log(2VpaP(,).‘ 


•) 


a 

33. In the problem of 15.8, p is defined so that /(^) = -(l - i.e. the 

P 

product of the capital good represents a net rate of return of lOOp per cent, 
(comjftited continuously) on its cost f{t). Find an equation for the length of 
life of the capital good so that p is a maximum. 
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34. In the competitive case of the problem of 15.8, the market rate of 
interest is such that the present value of the product of the capital good 
equals its cost. Show that the optimum length of life of the good as deter- 
mined in 16.8 equals that determined in the problem of the previous example. 

36. If f{t) = 3a^y^ and the competitive case of the previous example holds, 
show that the market rate of interest is approximately 4t^ per cent, per year 
and that the capital good has a life of approximately 28 years. 

36. In the problem of 16.8, if f(t) = bsjt and the market rate of interest is 
lOOr per cent, per year, show that the present value of all capital goods in use 

at any moment is approximately fc = 0*55 ^ . If the rate of interest is 6 per 

cent, per year, show that the present value is approximately 220 times the 
value of the yearly product of the capital goods. 

37. In the problem of 15.8, the cost of a capital good of life t years is £/(0 

where f{t) = 6^1- • Show that the optimum value of < is given by 

When the interest rate is 4 per cent, per year, Ifind by graphical methods the 
approximate value of t. 

38. In a sample of 898 working-class families in Bolton, 1924, the families 
were classified according to the number of rooms occupied ; 


No. of rooms 

2 

3 

4 

6 

6 

7 

8 

Total 

No. of families 

16 

477 

227 

1G9 

7 

2 

1 

898 


Show that 3*65 is the average munber of rooms occupied per family. Find 
the standard deviation of the nimiber of rooms occupied per family in this 
group of families. 

39. By differentiating both sides of the equation, show that 

I x^e~^ dx~r^ dx - 

(-00 ^ [b 

for any value of r. If denotes a;^e“®da: = Lim deduce that 

= and that /o= 1. Hence show that the average and standard devia- 
tion* of the frequency distribution y = xe~^ for the range from a; = 0 to infinity 
are x = 2 and = %y2. 
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DIFFERENTIAL EQUATIONS 


16.1 The nature of the problem. 

Data provided by scientific observation usually relate to small 
movements or changes in the quantities concerned. Information in 
“ bulk ”, covering a wide range of events, is scarcely to be looked 
for in practice. Translating the data into forms suitable for mathe- 
matical expression, we obtain relations between increments in 
variable quantities and, in the limit, the relations involve the 
derivatives or differentials of the variables. Mathematical theories 
in the sciences, if they are to be “ realistic ”, must thus be built on 
the b^^is of differential eqvxitiona, relations between the derivatives 
of differentials of varying quantities. It is then the business of the 
theories to deduce the functional equations between the variables 
which Ue behind the differential equations, i.e. to express the 
** general laws ” whose variations correspond to the given data. In 
physics, for example, much of the data is expressed in terms of 
variations in “ energy ”, of one form or another, and leads to 
certain fundamental differential equations from which the laws of 
‘‘ conservation of energy ” are to be derived. Differential equations 
thus occupy a fundamental position in most highly developed mathe- 
matical sciences. 

The nature of a differential equation can be examined first in the 
case where only first-order variations in two variables x and y are 
involved. The simplest case can be written 


dx 




where ^{x) is some given function. We have seen (16.3 above) that 
this case is easily dealt with by means of the concept of an integral. 
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A more general form of the case is 
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where the given function /(x, y) involves both x and y. Given the 
values of x and y, then the value of the derivative of the relation 
that must exist between x and y is also known. From this in- 
formation, we attempt to deduce the actual form of the functional 
relation, to find that particular function which gives a derivative 
.satisfying the above relation. This is the problem of the solution, 
or “ integration ”, of a differential equation, a problem which is 
clearly an extension of that already solved by simple integration. 
A still more general form of the type of differential equation here 
considered is a relation involving the derivative in a non-linear way, 

\2 

a relation including, for example, such terms as 


(dy\^ , ldy\ 

I ) or log 1 1 • 

\dx/ ° \dxl 


The development of the following sections is concerned almost 
entirely with the simpler linear differential equation, but it must be 
remembered that more complicated forms can be considered and 
solved. 


du 

The meaning of a differential equation of the form y)*is 


most clearly seen in diagrammatic terms. At each point (x, y) oi 
the plane Oxy^ the equation provides a definite value of the derivative 


dx 


This is the gradient of the direction in which the variables are 


allowed to vary from {x, y), i.e. the gradient of the tangent to the 
(at present imknown) curve through (x, y) expressing the relation 
between x and y implied by the differential equation. Our problem 
is thus that of constructing complete curves from knowledge only of 
their tangent gradients. An example illustrates. The equation 


dy ^ a;* - y* - 1 

2xy 

gives a tangeiat gradient at each point of the plane Oxy. At the 

d'u 

point (1, 1), for example, the tangent gradient is ^ = J, a line sloping 

upwards with gradient | through the point (1,1). A convenient set 
of points, covering the plane Oxy fairly uniformly, is selected and 
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the tangent gradient evaluated at each point. When lines with the 
appropriate gradients are drawn through the points, a graph of the 
kind shown in Fig. 97 is obtained. It now appears that a system 
of curves, one through each point of the plane, can be drawn to 
touch the lines of the graph. This curve system, some members of 



which are shown in Fig. 97, corresponds to the solution of the given 
differential equation. In this particular case, the curve system 
consists of circles with equation - 2ax-^\^0 where a is a 

parameter. This equation is the “ integral of the differential 
equation.* 

From a slightly different point of view, we can start from ‘any 
point of the plane Oxy and describe a curve through it by moving 
always in the direction indicated by the differential equation. 
Having obtained this curve, we can select a point not on it and 
describe a second curve passing through this point. The process 
can be continued until a whole system of curves ,is obtained to 
represent the solution of the given differential equation. 

♦ The use of the term “ integral ” to describe the solution of a differential 
equation is clear. The solution corresponds to a curve system which is built 
up, or ‘•integrated ”, step by step, from the tangent gradients given by the 
differential equation. 
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An important point now arises. The integral of a differential 
equation of the type considered is shown by a curve system and 
so it must involve a single parameter or arbitrary constant. This 
is illustrated by the integral, 2ax -f 1 = 0, of the equation 

above. The point is worth emphasising by putting the converse 
problem. The following examples show that a single arbitrary 
constant can be eliminated from a functional relation by differ- 
entiation : 

Ex. 1. From the relation -e^ =a, 
we obtain 2x dx -e^ dy = 0, 

dy 

i.e. the differential equation 


Ex. 2. From 
we obtain 

i.e. 


x^-\-y^ - 2ax + 1 =0, 
2x dx ■\‘2y dy - 2a dx == 0, 
(xdx+y dy) 


a — 


dx 


I.e. 

i.e. 


Substituting in the original equation, we find 

, „ ^ xdx+ydy , * 

■\-y^ -2x — - + 1 =0, 

(x^ -y^ - l)dx + 2xy dy = 0, 
dy x^ -y’^ -\ 

— - =3 — ■ 

dx 2xy 


a differential equation in which the parameter a does not appear. 

Ex. 3. Differentiating y^ - 2ax + = 0, 

2y dy -2adx = 0. 


we have 


dy 


Substituting a=y-,- in the original equation we find a differential 
(lx y 

(ly 

equation which is not linear in the derivative ^ : 

dx 

The integration of a differential equation reverses the procedure 
of these examples, taking us from a relation in the derivative to the 

dy 

complete relation •between x and y. So the integral of ^ = 2xe^* is 

a;*~e*'=a, and similarly for the others. In each case the integral 
involves one arbitrary constant : 

The integral of a differential equation in the derivative of y with 
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respect to x is a relation between x and y which includes a single 
arbitrary constant. 


The result stated, though not formally proved, has been illustrated 
suflSciently for our purpose.* 

To extend to cases of more than two variables, suppose that we 
have a relation between the first-order variations in three variables 
X, y and z. In its simple form the relation can be written 
f{x, y, z)dx-}-g{x, y, z)dy-dz = 0, 
giving the values of the partial derivatives 

^=f{x,y,z) and ^=g{x,y,z). 

In diagrammatic terms, we are given the tangent plane (by its 
gradients in the two fundamental directions) at each point of Oxyz 
space and require to ‘‘ integrate the planes into a system of sur- 
faces. The integral of the differential equation thus appears as a 
relation between x, y and 2, including one arbitrary constant, and is 
shown by a system of surfaces. An example makes this clear : 


Ex. The relation 
gives 

^ Substituting a = 2 


^-y^^az (a being a parameter) 
2x dx + ^y dy=a dz. 


xdx+y dy , 


dz 


in the original relation, we find 




xdx'Vydy 


i.e. 2a:2 dx + 2yz dy - {x^ + y*) dz = 0, 

dz 2xz - dz 2yz 

or ^ = -0 9 and 3-= ^ ■ 

dx x^+y* dy x^-vy^ 

We have a differential equation in which the parameter a doeS^ not 
appear. Conversely, the integral of the differential equation shown is the 
relation +y* =az including the arbitrary constant a. 


More involved differential equations in three or more variables 
can be written and some examples will appear later. The simple 
type here considered suffices for the moment. •' 


• It is to be noticed that the indefinite integral which is the solution of 
dy 

contains an additive constant, a particular case of the present 
general result. 
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16.2 Linear differential equations and their integration. 

Of the first-order differential equations in two variables x and y, 
the most useful is the linear differential equation in which the deriva- 
tive or differentials are related linearly, squares, cubes and other 
expressions in the derivative or differentials being excluded. We 
can write the equation either as 

P dx i-Q dy = Oy 

where P and Q are given functions of x and y, or as 

t=/(*. y), 

where /(r, y) is some function of x and y. To pass from one form to 
the other, it is only necessary to associate / [x, y) with the ratio 

y It is to be noticed that, though the derivative appears 

linearly, there is no such restriction on the variables x and y. 

As in the case of simple integration discussed in the previous 
chapter, there is no comprehensive practical scheme for integrating 
differential equations, even of the hnear type. Some particular 
methods for certain cases are given below but there remain many 
instances in which the integral can be obtained only after much 
trial and error or which cannot be integrated by any practical 
method. • 

The following are the four main cases for which definite practical 
methods of integration can be given : 

(1) The form ^ =/(»:)• 

By simple integration, the solution of this form is 

y=^f{x)dx->ra. 



where a is an arbitrary constant. 

(2) The form ^ !=/(*). 

This form ca^ij &lao be solved by simple integration. 


d^ 

dx 


(log y) = 


1 ^ 

y dx’ 



Since 
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Hence, 


logy^^f{x)dx 


+o, 


where a is an arbitrary constant. The variable y is found explicitly 
in terms of x by taking the exponential of the right-hand side. 


Ex. 1. 
Here 

and 


y{\ -z)dx -X dy =0. 

1 dy \ -X \ j 
y dx X X 

log y f(^ -li^dx + constant = log a; - x + 6. 



ite. y = (a:e~*). 

Writing a where a is thus an arbitrary constant, we have 

y^axer^. 

Ex. 2. To find y as a function of x when it is given that the elasticity 
of y with respect to a; is a constant a. We have 



X dy . \ dy a 

y dx y dx X 

So 

log V = a f “ + constant = a log a; + constant, 


i X 

i.e. 

• 

log == constant. 

i.e. 

V 

— = constant = a. 

a?* ’ 

i.e. 

y-=axf‘. 


Ex. 3. As X increases, the variable y increases at a rate always equal 

d/u 

to a multiple p of its current value : "^"o find y as a function of x, 

we write : 


So 


1 

y dx ^ 
log y=px + b, 


and y = = ae^*, 

where a is an arbitrary constant. Compound interest ^ewth at the con- 
tinuous rate of lOOp per cent, per year is a particular case of this result. 
(3) The form <f>{x)dx'\‘^(y)dy = 0, 

This^ case, in which the variables are said to be ‘‘ separate ”, 
holds whenever the differential equation can be so arranged that the 
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coeflScient of dx depends on x only and the coefficient of dy on y 
only. The integration is then effected at once : 

j'(^ (a:) da: + {y) dy ~ constant. 

It is only necessary, therefore, to evaluate two simple integrals. 

Ex. 1. e''^ = 2a;. 

dx 

Here 2x dx -e^ dy^ 0, i.e. 2 Ja: da: - dy = constant. 

So a;2-e*'=a, i.e. e*' = a;^ - a, 

or y^\og{x^-a), 

where a is an arbitrary constant. 

Ex. 2. {y ■^k)dx -h (x-\-h)dy =0, where h and h are fixed constants. 
dx 


Here 


a;-f- 


: dy ^ . C dx C dy 

-? H j =0, i.e. I 1 + — ~ = constant. 

h y-hk Jx + h Jy-tk 


So log(x-hh) +log(y-hk)=b, i.e. log (x -h h) (y + k) = b. 

Hence, writing a = e^, we have 

(x+h)(y-i-k)^a, 

where a is an arbitrary constant. The integral is represented by a system 
of rectangular hyperbolas (see 3.7 above). 

Ex. 3. The variable y increases as x increases at a rate given by ^ 

where L and p are given constants.* To find y as a function of x : 


Here 


L dy 

y{L-y) 


= p dx, 


, i.e.| 


L dy 

y{L‘-y) 


= px+b. 


* I ^J) = |(J + L^) dy-logy-log{L-y)=- log 


L-y . 


So 


log ~ — ^ = -px-b, i.e. — — ~ = ae~^ 


y y 

where a = is an arbitrary constant. After rearrangement, 

» • L 

^*I +ae“^ 


• This is a modified version of growth at compound interest. The curve 
representing y as a function of x is called a logistic curve and it is sometimes 
used to express the growth of a population over time. 
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(4) The “ exact form. 

It may happen that the diflFerential equation can be arranged, 
after multiplying through by a suitable factor, in the form 

P dx + Qdy — (), 

where the left-hand side is the differential of some function y), 
i.e. P dx-\-Qdy=^d(f> (x, y). The equation is then said to be “ exact 
and its integral is clearly 

(f) (a:, y) = constant. 

The following examples illustrate how equations can be arranged 
so that they appear in the exact form. 


Ex. 1. ydx-\-xdy=0. 

The equation is exact as it stands and can be written d (xy) = 0. 
The integral is xy - a, where a is an arbitrary constant. 

Ex. 2. ipc^ -y^ - \)dx 2xy dy-0. 


No, 


2(x dx-¥y dy)x - {x^ -{-y^ + \)dx 
2x^ 

(x^ -y^ - l)dx + 2xy dy 
2^2 

1 


Hence, on multiplying through by — the given equation becomes 

^ X^ "t" t/^ ”t" 1 

and the integral is ^ = a, where a is an arbitrary constant. 

So -2aa:4-l =0, 

as represented by the system of circles of Fig. 97. 

Ex. 3. ^ = -2jy ^k ^ where h and k are fixed constants. 

CLX X tv 

We can write the equation in the form 

The integral is thus 

h-s/y + k . 

— ^ — =a, where a is an arbitrary constant. 

Tljis integral is shown by a system of curves, each being the arc of a 
parabola (see 3.7 above). 
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The difiFerential equation of the linear type in three variables 
X, y and z appears in the form 

P dx-^Qdy-\-Rdz — (), 

where P, Q and R are three given functions of a;, y and 2 . This is 
often termed the total differential equation. The integral is a relation 
between the variables including a single arbitrary constant and 
reproducing, on differentiation, the given differential equation. 
Since we can take 2 as a function of x and 1 /, we have 


dz 

dx 


= f-) 

\dx) 


y constant 


p 

R 


and 


dz 

dy 


~ \dy 


dy) 


X constant 


Q 

R' 


An alternative expression of the differential equation is thus 


dz 

dx 


and ^^=g{x,y,z), 


where the functions / and g are to be associated with ( - ^ ) and 

/ m . ^ 

( ~ ^ 1 respectively. The cases where there are more than three 

variables are exactly similar. Practical methods of the kinds indi- 
cated above serve to integrate particular forms of the total differ- 
ential equation.* The following examples illustrate the procedure : 

• * 

Ex. 1 . The partial elasticities of 2 as a function of x and y are constants 
a and p respectively. To find z in terms of x and y : 



X dz T y 9-2 - 

- =a and - =p, 

zdx zdy 

i.®. 

dx ^dy dz ^ 
a — + 

X ^ y z 

So 

aj — (6 an arbitrary constant) 
a log a; + log y - log 2 = 6 , i.e. log 

• 

and 

Writing 

the integral of the differential equation io 


2 == axf^y^. 

Ex. 2. 

xdx ^ydy -\-zdz=0. 

So 

^xdx +^ydy -hjzdz = constant, 

i.e. 

\x^ + \y^ + \z^ = constant. 


* For a method which serves in most ordinary cases, see Piaggio, Differential 
Equations (1920). p. 139. 
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The integral required is thus represented by the system of spheres 

where a is an arbitrary constant. 


Ex. 3. 2xzdx + 2yzdy - (x^ + y^) dz = 0. 

Multiplying through by ^ , the equation is thrown into “ exact ** form 

d(^).o. 



and the integral of the equation is x^-{-y^=^aZy where a is an arbitrary 
constant. This integral is represented by a system of surfaces known aa 
“ paraboloids ** (see 11.2-3 above). 


16.3 The general integral of a linear differential equation. 

Two points arise in considering linear differential equations from 
a general and theoretical, as opposed to a practical, angle. One 
concerns the question whether an integral of a given equation exists 
at all, and the other whether the integral, if it exists, is uniquely 
determined or not. 

The fundamental ‘‘ existence ” results, which answer the first 
‘question, can be stated, but not formally proved, here.* The linear 
differential equation in two variables, 

Pdx + Qdy=0 or y). 

can be shown to possess an integral in aU cases where certain con- 
ditions are satisfied by the functions P and Q, or by the function /. 
These conditions are satisfied in aU ordinary cases, e.g. when / is a 
function with continuous partial derivatives. In the two-variable 
case, therefore, we can assume that aU the usual linear differential 
equations are integrable. 

A complication arises when we pass to cases of three or more 
variables. It can be shown that the linear differential equation 
Pdx-hQdy + Rdz=^0 fails to possess an integral even in quite 

* See de la Vall6e Poussin, Cours d^ analyse injinitisimale, Vol. II (5th Ed. 
1925)*, pp. 133 et seq.y pp. 292 seq , ; Piaggio, Differential Equations (1920), 
pp. 139 et seq. 
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ordinary cases. The integral exists only if the functions P, Q and R, 
having continuous partial derivatives, satisfy a certain relation 



which is called the integrability condition. We cannot assume that 
the equation hS/S an integral at all without first imposing or verifying 
the integrability condition. For example, the equation 

2zz dx -f 2yz dy - (a;^ + y^) rfz = 0 

is found to satisfy the integrability condition ; its integral has, in 
fact, been found above. On the other hand, the equation 

dx-\-zdy^-2ydz — 0, 

though quite simple in form, fails to satisfy the integrability con- 
dition and possesses no integral. 

In diagrammatic terms, these important results imply that a set 

of tangent lines in the plane Oxy with gradients given by ^ =/(3J, y), 

where / is a given function with continuous partial derivatives, can 
always be built up, or “ integrated into a system of curves. But 
this is not true of the corresponding case of a set of tangent plants 
in space Oxyz, It may not be possible, even in quite simple cases, 
to build up the planes into a system of surfaces, i.e. to find^a 
set of surfaces to fit the given tangent planes. It is not easy to 
see why the extra dimension spoils the simphcity of the previous 
result. 


Turning to the question of the uniqueness of integration, the 
integral of a finear differential equation in two variables is an 
equation in the variables including one arbitrary constant. This 
constant must be involved linearly. If, for example, squares or 
cubes of the constant appear, then the process of eliminating it must 


lead to an equation involving squares or cubes of the derivative ^ , 


i.e. to a non-hnear differential equation. Example 3 of 16,1 illus- 
trates this poiflt.* It follows that the constant can be separated off 
linearly from the variables of the integral,* i.e. 


* In fact, the curve system representing the integral of the differential 
equation must be such that one, and only one, curve of the system passes 
through each point of the relevant part of the plane Oxy (see 3.7 above). 
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The integral of the linear differential equation Pdx + Qdy = 0 can 
be written ^ (a:^ y) = constant, 

where ^ is a function with partial derivatives proportional to P and Q. 

Ex. The equation (** -y* -l)<ia;+2a:ydy=0 has an integral 
** +y* -2aa: + 1 =0, 
which can be arranged in the form 

— = constant. 

X 

rr j/ \ a:*+f/* + I 

Here ^(*. !/) = z » 

, dd) x^-y^-l d6 2xy 

which are proportional to the coefficients of dx and dy in the original 
differential equation. 

It is easily seen that the form of the function <f> (a;, y) is by no means 
unique. In fact, if <f){x^ i/) = constant is one form of the integral of 
the given equation and if is any function, then 
F{<f>{x, y)}= constant 

is also a form of the integral. For, if ^(a;, y) has a constant value 
for certain values of x and j/, then F{^(a:, y)} also has a constant 
value for these x and y. Hence : 

If ^{x, y) = constant is one integral of P dx-\-Q dy=^Q^ then the 
general integral is (a;, y)}= constant, where F is an arbitrary 
function. 


This lack of uniqueness only affects the form of the analytical 
expression of the integral ; it does not correspond to any lack of 
uniqueness in the integral itself. The curve system which represents 
the integral is quite unique, being obtained equally well from 
F{<f){x^ y)}= constant as from ^(a:, y) = constant. The contour map 
of the function z—F{<f>{x, y)} does not depend on which form of F 
is selected and always gives the integral of our differential equation. 
For example, the integral of {x^-y^- l)da; + 2a;ydy=r0 is shown by 
the perfectly definite system of circles (Fig. 97). The analytical form 
of the integral can, however, appear in various disguises, e.g. 



X 
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Exactly similar results hold for the total diJfferential equation in 
three variables P dx-\-Q dy-\-R provided that the integrability 
condition is satisfied. The form of the integral appears 

Vy 2 )} = constant, 

where ^ [x, t/, z) = constant is one integral and F is an arbitrary 
function. The partial derivatives of <f> are proportional to P, Q and S, 
But the integral itself, as represented by a system of surfaces in 
space, is quite definite. The result extends, of course, to cases of 
more than three variables. 

16.4 Simultaneous linear differential equations. 

Two linear differential equations in three variables must give rise 
to two functional relations between the variables, each including 
one arbitrary constant, and represented by two systems of surfaces 
in space. An actual example will illustrate the position. 

Ex. By differentiating and eliminating the constants in 
x-hyi-z-a and 

we have dx + dy -i-dz^O and 2xz dx + 2yz dy - (x^ + y^) dz = 0. 

The two functional relations provide the solution of this pair of linear 
differential equations. The latter can always be arranged to give the 
ratios of dx, dy and dz in terms of x, y and z. Here, on solving the differ- 
ential equations for the differentials, we find • 

dy x*+y* + 2xz , a? -y 

dx x^-\-y^ + 2yz dx ^ x* + 4 . 2yz 

dx -dy _ dz 

x^-\-y^ + 2yz x^ -i-y^ -\-2xz 2z{x-y)’ 

Further, the relations giving the differential equations can be arranged 
to give any two of the variables (say, y and z) as functions of the third (x). 
In the present case, we find 

• y = ± Jn/( 62 +4a6) -46x -4x* - ^6 
and z = T +4a6) -46x -4x* -x + 0 -f J6. 

These functions involve the two arbitrary constants a and 6. We can 
now say that th^ (^erivatives of these two functions must correspond to 

the values of ^ and ^ given from the differential equations. In dia- 

(JLOO CLOC 

grammatic terms, the solutions of our differential equations is sbpwn by 
a pair of systems of surfaces in space. But the intersection of each pair 



426 


MATHEMATICAL ANALYSIS FOR ECONOMISTS 


of surfaces (one from each system) is a curve in space. The solution is thus 
shown by a whole system of curves, not of the plane kind, but of the spatial 
kind. In the present case, the curves are parallel plane sections (by 
x+y + z=a) of paraboloid surfaces (x^ -{-y^ =bz). These curves are in 
space ; they do not lie in one and the same plane. 


In general, two given linear differential equations in three variables 
can be arranged in the form 


dx __dy ^dz 

T'^'Q^R 


where P, Q and R are some functions of a;, y and z. So, if we con 
sider y and z as functions of a;, we can write 


^ =fi^, y, z) and J = gr (z, y, z), 

Q It 

where the functions / and g are equivalent to the ratios ^ and p. 


It can be shown that the integral of these equations exists in all 
cases subject to the usual conditions concerning the continuity of 
the functions P, Q and P, or the functions / and g* The integral 
appears in the form of two relations between the variables, each 
involving an arbitrary constant which can bo separated off linearly : 


<f> J/, 2^) = a and ip {x, y,z)=b. 

As before, the analytical forms of (f> and ip are not unique. In fact, 
the integral can be expressed in general as 


F{<p {x, t/, z)} = constant and 0{ip {x, y, z)} = constant, 

where F and O are any fimctions. Alternatively, the integral can 
be written, on solving the two relations, as giving (say) y and z as 
functions of x. Each function involves two arbitrary constants and 
its analytical form is not unique. 

The integral described can be shown, in diagrammatic* terms, as 
a pair of systems of surfaces in space, i.e. as a system of curves 
in space. This curve system is perfectly definite and not affected 
by the lack of uniqueness of the analytical form ,of the integral. 
The nature of this diagrammatic representation can be examined 
further. 


♦ Seeide la Vall6e Poussin, Cours d" analyse infin iUsimale, Vol. II (6th Ed. 
1926), pp. 14} et seq. 
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Let P be any point on a curve in space, and draw the tangent line 
to the curve at P (Fig. 98). The “ projection ” of the curve and its 
tangent on to the plane Oxy gives a plane curve passing through a 
point Q with a definite tangent, Q being the foot of the perpendicular 
from P to Oxy, The plane curve can be regarded as the “ shadow 
of the spatial curve on the 
plane Oxy (taken as horizon- 
tal). The curve in space is 
represented by two functions, 
y and z as functions of x. 

The shadow plane curve is 
then represented by the first 
of these functions, y as a 
function of x, and the grad- 
ient of its tangent at Q is 
indicated by the derivative 

In the same way, the 

space curve has a ‘‘ shadow ” 
on the plane Oxz, a curve 
representing z as a function of x and with tangent gradient given 
dz 

by ^ . The curve in space and its tangent at P can thus J)e 

investigated by means of two shadow curves and their tangents. 
Now, the given differential equations 

^=/(^. y, 2) and ^=g{x,y,z) 

eiv« values of ^ and ~ at each point P (x, y, z) of space, and so 
dxdx 

determine two “ shadow tangents, one in the plane Oxy and the 
other in Oxz, These “ shadows ” can be combined to give a definite 
tangent line in space at P. The integral of the differential equations 
is then obtained by building up the tangent lines into a system of 
curves in space, one curve passing through each point of space. 

The practical process of integrating a pair of differential equations 
of linear form is often extremely difficult.* The following examples 
illustrate the practical integration. ^ 

* See Piaggio, Differential Equations (1920), pp. 133 seq. 
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dx_dy_^ 


^ ^ , . 1 dy V ^dy 1 

One of these equations can be wntten or 

^ dx X ydx X 

On integrating this by ordinary methods (16.2 above), we have 

y-dx (a being an arbitrary constant). 

dz z 

Similarly, from the other equation , we find 

CtX X 


z^hx (6 being an arbitrary constant). 

The integral of the pair of differential equations is thus 
y^ax and z=6a;, 

shown by a system of straight lines in space, all passing through 0. 

dx dy ^ -dz 
2xz 2yz 


Ex. 2. 


x^-^y^ 

It can be verified that these differential equations can be arranged in 
the form (amongst many others) 


X y 


and a; da; + y dy + 2z dz =0. 

We can now integrate at once : 

= constant and Ja; dx + dy + 2jz dz = constant, 

i .e. log a; - log y = constant and \x^ \y^ + == constant. 

^The integral of the equations can then be written 
y=aa; and x* + + 2z* = 6, 

where a and 6 are arbitrary constants. The two surface systems are now 
a system of planes (y =ax) and a system of surfaces known as “ ellipsoids ** 
(x*+y^ +2z*=6). The integral is shown by a set of plane sections of a 
system of “ ellipsoids 


Ex 3 ^ -dy dz 

x*+y* + 2yz**x*+y* + 2xz 2z(x~y)’ 

It can be verified, in this case, that the two relations 

dx + dy+dz=0 and 2xz dx + 2yz dy - (x* +y*)dz=0 
hold. The integral of the first relation is found at once as 

x+y+z=a ** ♦ 

snd that of the second (16.2 above) as 

x*+y* = 62 , 

where a^,and b are arbitrary constants. The complete integral is thus 
obtained in the form already considered above. 
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16.5 Orthogonal curve and surface systems. 

The linear differential equations in two variables 

+ = 0 and ^ = ^ 

give two tangent lines at each point {x, y) of the plane Oxy, one with 


gradient (to Ox) equal to [ - and the other with gradient equal 
Q . ^ V/ 

to p • Since the product of these gradients is ~ 1, the tangent lines 


are always perpendicular. Hence, the two curve systems repre- 
senting the integrals of the equations must be such that, wherever 



a curve of one system cuts a curve of the other system, the two 
curves are at right angles. Such curve systems are described as 
orthogonal systems. An example of differential equations of the 
orthogonal type is given hy ydx^-xdy = 0 and xdx-ydy=^0. It is 
easily seen that the integrals are respectively xy — a and x^-y^ — b, 
where a Sjpd b are arbitrary constants. The curve system, in each 
case, is a system of rectangular hyperbolas, one system being 
obtained from the other by rotation through 45®. Fig. 99 shows 
certain curves of^the systems in the positive quadrant of Oxy and 
clearly indicates the orthogonal property of the systems. 

The total differential equation in three variables 
P dx -vQ dy Rdz^O 

gives a tangent plane at each point (x, y, z) of space, and its gradients 
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are determined by the ratios of the functions P, Q and R, The pair 
of linear differential equations 

dx ^dy _dz 

T^~Q~R 

gives a tangent line at each point of space and its direction is deter- 
mined by the functions P, Q and R. It is a well-known result that 
this tangent line is perpendicular to the previous tangent plane at 
the same point. This can be verified by taking projections on the 
co-ordinate planes. The tangent plane contains a horizontal line 

(parallel to Oxy) with gradient referred to Ox equal to ^ ^ • 

This projects directly on to the plane Oxy. The tangent line, as we 
have seen, has a projection or ‘‘ shadow ” on the Oxy plane with 


gradient to Ox equal ^ ^ ^ ' 


These two lines in Oxy are perpen- 


dicular. A similar result holds for the projections on to one of the 
other co-ordinate planes, and it is then clear that the tangent line in 
space must be perpendicular to the corresponding tangent plane. 

The total differential equation (if it is integrable) is shown by a 
system of surfaces in space and the pair of linear differential equa- 
tions by a system of curves in space. These systems must here 
possess the property that, wherever one of the surfaces is cut by one 
of the curves, the intersection is at right angles. The surfaces and 
curves are said to form orthogonal systems. For example, the total 
differential equation xdx + ydy^^zdz^O has an integral which is 
the system of concentric spheres (centre at 0) x^-hy^-hz^ = c. The 

pair of linear equations ^ ^ ^ has been found to integrate to 


y=ax and z = bx, a system of straight lines aU passing through O. 
The lines are the diameters of the spheres and the two systems are 
clearly orthogonal. 


16.6 Other differential equations. 

There are many types of differential equations^ more complicated 
than the simple linear equations so far considered. For example, 
/dy\* dy 

^ ^ ^ non-linear equation involving only a 

first-ofder derivative and its integral (see 16.1) is y*-2aa:+a* = 0, 
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where a is an arbitrary constant. Again, by diflFerentiating the 

relation y=aa: + - twice and eliminating the parameters a and 6, 

dtf 

we obtain the differential equation + ^ - 1/=:0 which 

CLX CLOC 

involves derivatives of the first and second orders. The integral of 
the equation is the relation above and includes two arbitrary 
constants.* The practical methods of integrating such differential 
equations are not easy. The following general problem can be taken 
as an illustration. f 

Ex. The “ dog and hia master ” problem, 

A man stands at a point 0 and his dog at a point A in an open field. 
The man begins to walk at a uniform 
pace along a path at right angles 
to OA, while the dog runs at a 
uniform pace (greater than that of 
the man) always in the direction 
pointing to the position of the man. 

When does the dog catch up with 
the man ? 

Take co-ordinate axes as shown 
in Fig. 100, Oy being the direction of 
the path taken by the man. Choose 
the unit of distance as OA, so that A 
is (1, 0). Let the ratio of the man’s 
speed to that of the dog be u, where 
u<i\. After a certain time, suppose that the dog is at the point P(x, y) 
and the man at Q on Oy, If the path taken by the dog is as shown with 
equation y=f(x), then PQ is the tangent to the path at P with gradient 

* _ OQ-y 

dx MP X 

So OQ=^y-z^£. 

* This suggests an extension of the result of 16.1 above. It can, in fact, be 
shown that, if the nth derivative is that of the highest order in a differential 
equation in two variables, then the integral of the equation is a relation 
between the variables which includes n arbitrary constants. 

t For an account of this and similar problems, see Osgood, Advanced 
Calctdua (1926), p. 332 et aeq, * 



Fio. 100. 
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L»t « =»arc AP of the dog’s path 

(distance travelled by man) 

Now « is a decreasing function of x with negative derivative 


da 1 fdy d^y dy 
dx u \dx dx^ dx, 


')■ 


xd^ 

udx^' 


The increment ds, being compounded of perpendicular increments dx 
and dy along the dog’s path, is given by da^ =^dx^ -{-dy^, 

taking the negative root since the derivative is negative. Hence, 

-V^- -iS- 

The equation, y-f{x), of the dog’s path is thus the integral of 

a differential equation of the second order. To integrate, write 1^=^ 

so that Then 

dx ax* 


i.e. 


I. 


== u f— + constant. 

s/l+p^ 


So 


log(p + n/ 1 +p*) log x + const., i.e. p + s/l -fp*=ax“. 
dy 


Since jp = “ = (gradient of path of dog at .4)==0 when x=»l, we find, 
on substitution, that the constant a = 1. Hence, ' 

X“=-yi 4-p. 

1 _ VITp-p . 


So 

and 

Hence, 


x-« = 


Jl +p (Jl +p*)* -p* 
x«~«-^=2p. 


. +p* -p 


and c y=^^Jx**dx-Jx~“dx^+constant=^ +6. 
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u 

At A, y=0 when x = l, and so we find 6 = ^—^. The equation of the 
dog’s path is thus 

_ u 

^ ^ 2(1 +M) “ 2 ( f-u) 

The dog catches up with the man when a; =0, in which case 


OQ^y=-, 


u 


1 -u^ 


The man walks a certain multiple distance OA before the dog 

catches up with him. For example, if the dog’s speed is twice that of the 
man ('W = i), the equation of the dog’s path is i/ = J - 3 Vx + 2) and the 
man walks only two-thirds of the distance OA before the dog reaches him. 


Similar complications arise in equations which involve the differ- 
entials of three or more variables. Further, when the number of 
variables exceeds two, we can define equations of a rather different 
type involving partial derivatives instead of differentials. The 
simplest form of this typo of equation occurs when the partial 
derivatives are related linearly as in the following example : 


Ex. 


dz dz 


The equation is integrated when we have expressed z explicitly in* 
terms of x and t/. The following trick is needed in the particular case 


considered. 


Write =- so that 

X 




1 / 9z , 




From .the original differential equation, we have * ~ “ 2/ * 


So 


, dzoc di! - y dx 9 - , 
du = ^ d 




dy dy 

Hence the variation of w as a function of x and y depends only on the 
variation of - , and not on the separate variations of x and y. It follows 

3 / 

u • ^ y 

that u is some function of - only, and so 

m 

whore <f> is any function whatever. This is the integral of the given 

P M.A. 
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differential equation. It is to be noticed that the arbitrary element in 
the integral consists, not of an arbitrary constant, but of an arbitrary 
function. This is a special feature of equations in partial derivatives. 

The form of the integral shows that 2 is a linear homogeneous function 
of X and y (see 12.8 above). The integral, in fact, provides the converse 
of Euler’s Theorem : 


dz dz 

If X ^ + y ^=^z at all points (x, y), then 2 is some linear and 
homogeneous function of x and y. 


As an application of the result, suppose that a good X is produced by 
using two factors A and B according to the production function x =^f(a, b) 
and that the product is always divided (without excess or deficit) according 
to the marginal productivity law. Hence, each unit of the factor A 

dx 

receives a return equal to the marginal product ^ and similarly for the 
factor B. It follows that ^ 


a 


dx dx 
db 


-X 


for all combinations of the factors. This is only possible, as our present 
result shows, if the production function is linear and homogeneous. 


16.7 D 3 mamic forms of demand and supply functions. 

, A market for a good X is made up of a competitive group of buyers 
and a competitive group of sellers. Under static conditions, assum- 
ing that the prices of other goods are fixed, the demand of the 
buyers can be represented by a function x = <f)(p) and the supply of 
the seller by a function x=f{p), where p denotes the imiform price 
of the good. A position of static equilibrium is said to be determined 
at a price which equates demand and supply : (l>(p)=f(p)-> This 
is an equation in one variable with a definite number of solutions, 
each corresponding to a possible equilibrium price. 

The static problem implicitly assumes that there is no “ specu- 
lative ” demand or supply and that there is no “ time-lag ” in 
production on the supply side. Our problem can be extended by 
dropping these assumptions in turn. The market price of X is now 
taken as varying over time, the current price being a function of t, 
the time (in years or some other unit) since a fixed base period. We 
Kave^fco find, under various assumed conditions, the form of the 
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price function p(t). The static problem gives a definite price and 
equilibrium position ; the extended “ dynamic ” problem must give 
the course of prices over time and the path of a moving equilibrium. 

The demand of the market (as a rate per unit of time) at the 
current time t is taken as depending on the price p (t) and on certain 
speculative elements, including the buyers’ view of the future 
course of prices. We consider here only the simple case in which 
the buyers’ “ foresight ” is confined to an allowance for the rate at 
which the price is changing at the time concerned, demand depending 
on whether prices are rising or falling and at what rate they are 
changing. We now have demand as dependent on p{t) and its 
derivative p\t) : 

X = <l>{p (t), p'(t)}, the dynamic demand function. 

In the same way, assuming the same kind of speculation (but no 
timedag) on the supply side, we can write 

x=f{p{t)^ p\t)}y the dynamic supply function. 

For equilibrium over time, the price function p {t) must be such that 
demand and supply are equal at all times t : 

This is a differential equation including the first derivative of p 
with respect to and its integral gives ^ as a function of t involving 
one arbitrary constant. The latter can be determined in terms of* 
the price p^ ruling in the base period (^ = 0). Hence, given the 
initial price, the course of prices over time is determined. 

The problem can be illustrated by taking the particular case 
where the demand and supply functions are linear. Under static 
conditions, we write demand as x=^ap-\-b and supply as a: = ap+)3. 
In the*“ normal ” case, the constants a and are negative and 6 and 
a are positive. For equilibrium, demand equals supply, i.e. 

. 6 = ap+j8, 

giving one equilibrium price p = ^ — - • 

oc — a 

In the simple speculative case considered, we write : 

Demand x = ap{t) +cp'{t). 

Supply x = ocp(t)-{-P^ yp'{t)> 

where a, 6, a and jS are constants with the signs indicated above and 
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where c and y are new constants which can be taken as positive. 
This implies that a rising price, other things being equal, stimulates 
both demand and production. It is possible, however, to take y as 
negative if a rising price causes sellers to hold back their supplies. 
For a moving equilibrium, the price function p (t) must be such that 

ap (04-6+ cp'(t) = ap (0 + jS + yp\l), 

i.e. p*(t)=- — ~{p{t)-p} where p = ^ 

a-a 


Write 

Then 


q{t):=p{t)-p 

q\t)=^Xq{i\ 



i.e. 


qdt 


This familiar differential equation has integral q=zAe^*. 

So p{t)=:p + Ae^K 

Putting ^ = 0 and p=Po (the initial price), we find The 

course of prices over time is thus uniquely given by 

p{t)=p-h{po-p)e^*, 


where p = 


oc — a 


, A = 


Qc — a 
c-y 


and Pq is the initial price. 


The important term in the expression for p (t) is This term 
increases and tends to infinity, or decreases and tends to zero, 
according as A is positive or negative. Since we have taken (a - a) 
as positive, the sign of A is governed by that of (c-y). Hence, 

(1) if y<c, A is positive and the price steadily diverges from 
the static equilibrium value p as time goes on. 

(2) If y>c, A is negative and the price steadily approaches 
the static equUibrium value p as time goes on. 

A stable course of prices is thus only possible if y>c, i.e. if the 
speculative element in supply is stronger than that in demand. 

If, in addition to the speculative element, tljere is a time-lag of 
constant length 0 years in production, then the supply of X is 
dependent on the price and the price changes at the earlier time 
(t - 0) rather than at the current time t. The current supply was 
arrarifeed for 6 years ago at the beginning of the production process 
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and was subject to the price then ruling. The general condition for 
moving equilibrium is now 

pm=f{p{t-0). p'it-e)} 
and with linear demand and supply functions 

ap {t)+b+ cp\t) ^oLp(t-e)+^-\- yp'{t - 0). 

Such differential equations can be integrated only with the aid of 
‘‘ complex numbers ”, entities designed for the description of 
variables which oscillate in value. But the general solution is found 
to be very similar to that of one special case which we can easily 
solve.* 

When there is a production time-lag but no speculation, the 
equation for p(t) ceases to be differential and takes the form 
ap (0 + 6 ^ ap {t-0)^- p, 

i.e. 

Here, linear demand and supply fimctions are assumed and p has 
the same value as before. If the course of prices over the initial 
period from t = 0 to t = 9 is given, then the equation above gives the 
price p{t) at any subsequent time t. In fact, measuring all prices 
from the level p, the current price is just a constant multiple of the 

price 6 years ago. The multiple is the negative number - . There* 

a 

are three cases to consider. Suppose, for simplicity, that the prices 
in the period (0, 6) oscillate about p, being as much below p at 
t = 6 as above p at t = 0. Then ; 

(1) If a and a are numerically equal, then the price variation in 
the period (0, 20) completes the oscillation of the period (0, 0), the 
price at t = 26 being the same as that at ^ = 0. In successive periods 
of 20 years, the same cycle of prices is repeated. The course of price 
over time is* illustrated in diagram A of Fig. 101. 

(2) If a>a numerically, then the ratio - is negative and numeri- 

cs 

cally less than unity. The price again oscillates, describing a cycle 

♦ On the problems considered here, see Tinbergen, Utilisation des Equations 
fonctionelleSy Econometrica, 1933, and Theis, A quantitative theory of industrial 
fliLctuations, Journal of Political Economy, 1933. The first work in thi* fielc^ 
is due to Roos and Evans. 
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in every period of 2d years, but now the extent of the oscillation 
diminishes as time goes on. The oscillations are “ damped as 
shown in diagram B of Fig, 101. 

(3) If a<a numerically, the case is similar to the previous one 



except that the oscillations increase in amplitude as time goes on. 
This case of “ explosive ” oscillations in price is illustrated by 
diagram C of Fig. 101. 

16.8 The general theory of consumers’ choice. 

The analysis of the demand of an individual for consumers’ goods 
has proceeded, so far, on the assumption that the individual’s scale 
of preferences can be represented by an indifference map, i.e. by a 
utility function (measurable or not). This assumption can now be 
scrutinised. ' • 

Suppose that a; and y are the individual’s current purchases of 
two goods X and Y. It is now assumed simply that the individual 
has %, definite scale of preferences for small changes in these pur- 
chases. He can distinguish that small increase Jy in his purchase of 
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F which just compensates (i.e. leaves him indifferent) for a given 
small decrease ( - Jx) in his purchase of X. Larger increases than 
Jy are preferred by the individual whereas smaller increases leave 
him in a position worse than before. Hence, for an indifferent 
change, Jy is a definite multiple of Taking the hmit for 

smaller and smaller changes, the assumption defines the differential 
dy as a definite multiple of ( -dx). The multiple (which is positive) 
is defined for each initial set of purchases {x, y) we care to select and 
varies as these purchases vary. The multiple is, in fact, a function 
of the variables x and y and we can write it as R (x, t/). Our assump- 
tion thus gives the differential equation for indifferent changes : 


dy 

dx 


y) 


or R{x, y)dx -\-dy — 0. 


The function R takes only positive values. It is further assumed 
that the scale of preferences is continuous in the sense that R has 
continuous partial derivatives. 

The differential equation above can be integrated in all cases 
(16.3 above) to give the general integral 


F{(f>{x^ y)} =constant. 

Here F is any function and <f){Xy y) is a function such that 

^;= 

In diagrammatic terms, our assumption corresponds to the defini- 

du 

tion of a definite “ indifference direction given by -^—R{x,y 

at each point {x, y) of the plane Oxy. The indifference directions can 
then J)e integrated to give a whole system of indifference curves 
defined by (a;, j/)} = constant. The situation is exactly as 
before but based on a different assumption. We do not assume a 
complete scale of preferences but a scale for small changes from any 
given set of purchases. Finally, the utility function index 


u=F{^{x,y)}, 

with a constant value along any one indifference curve, follows from 
our new assumption. 

From 13.8 above, the function R{x^ y) here defined is to be^inte^ 
preted as the marginal rate of substitution of Y for X, Previously, 



440 


MATHEMATICAL ANALYSTS FOR ECONOmSTS 


we deduced this concept from an assumed utility function. The 
position is now reversed ; the marginal rate of substitution is the 
fundamental concept and the indifference map and utility function 
are deduced from it. This is an important change of approach. 

The general case where any number of goods appear in the con- 
sumer’s budget is sufficiently represented by the case of three goods 
X, T and Z. The assumption to be adopted asserts that the indi- 
vidual has a scale of preferences for small changes from a given set 
of purchases (x, i/, z) so that he can distinguish that small increase 
Jx in the purchase of X which compensates for given small decreases 
and (-^dz) in Y and Z. The individual prefers a larger 
increase than Jx, but rejects a smaller increase. Taking smaller and 
smaller changes and writing differentials for the increments, we have 
dx as a multiple of ( -d^) plus a multiple of ( -dz). The multiples 
are positive and functions of the purchases (x, y, z). So 

dx = Jli (x, y,z)(- dy) + It^ {x, y,z){- dz) 
or dx + {x, y, z) dy + i? 2 2/> z)dz^0 


is the differential equation for indifferent changes in purchases. The 
functions Ri and i ?2 positive values and are assumed to have 
cantinuous partial derivatives. 

The interpretation of iZj and is not difficult. The ratio of 
compensating increments in the purchases of X and Y when the 
purchase of Z remains unaltered can be written 

(-^) =Ri{x,y,z). 

\ ay / j, constant 


The function R^ thus represents, at the purchases {x, y, z), the 
marginal rate of substitution of X for Y, In the same way R^ repre- 
sents the marginal rate of substitution of X for Z, Our assumption 
is equivalent to the definition of two marginal rates of substitution 
as describing the individuars scale of preferences. In diagrammatic 
terms, we have, at each point of Oxyz space, an indifference plane 
given by dx R^dy R^dz^Q and with gradients ( ~i?i) and ( -R^ 
in the Oy and Oz directions. The plane includes' aV the indifferent 
directions of change from the purchases (a;, y, z). 

The differential equation dx^R^dy ^R^dz — ^ need not possess an 
integral (16.3 above). In general, therefore, we cannot integrate 
the set of indifference planes into a complete set of indifference 
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surfaces, and we cannot assume that any utility function exists. 
The assumption of a scale of preferences for small changes of pur- 
chases does not imply that a complete scale of preferences exists. 
The consumer can discriminate between small changes from his 
established purchases but need not be able to discriminate between 
widely different sets of purchases. The assumption of the marginal 
rates of substitution as fundamental has shown up a new possibility. 

The differential equation is integrable only in the special case 
where and i ?2 satisfy the integrability condition (16.3 above) : 


( dR, dRA (dR, dRA 


= 0 . 


In this case wo can write an integral in the form 
F{(f){x .y ,z)} — constant, 

where F is any function and y, z) a function such that 

^=Ri{x,y,z) and y, z). 

9x 9x 

The integral is shown by a system of indifference surfaces touching 
the given indifference planes and the function u = F{(f>{x^ y, z)} can 
be taken as a function index of utility. The given marginal rates 
of substitution are equal to the ratios of the partial derivatives bf 
u in any one of its forms. It must be remembered now that the 
indifference map and utility function are deduced from the marginal 
rates of substitution and not conversely. 

In developing the theory of individual demand for three or more 
goods, therefore, we proceed from the assumption of marginal rates 
of substitution. As a special case, we can then add the integrability 
assumption, take an indifference map and a function index of 
utiht;^ and deduce certain results which hold only in this case. We 
have a general theory and a special (integrability case) theory of 
demand, a «distinction which will be made clear later (19.7-8 below). 
The results of 16.5 above are of direct application to the present 

dx dn dz 

problem. The differential equations ^ define a tangent 

direction at the point {x, y, z) in space, a direction perpendicular to 
the indifference plane given by dx + R^dy +R^dz^0, If the indi- 
. vidual increases his purchases from the given set {x, y, z) in tije wa^ 
indicated by this direction, these are the increases he prefers above 
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all others. The direction can thus be termed the individuars 
'preference direction at {x, y, z)* The values of and It 2 at any 
point thus indicate, not only the marginal rates of substitution for 
indifferent changes, but also the ratios of the increases in purchases 
which are most preferred by the individual. 

Since and R 2 have continuous partial derivatives, the differ- 
ential equations for the preference direction can be integrated to 
give a system of curves in space, the Lines of preference of the indi- 
vidual (16.5 above). One such curve passes through each point of 
space and indicates the most preferred direction of change from the 
point. The complete lines of preference exist even when there is no 
indifference map. But, in the integrability case where indifference 
surfaces exist, it follows that the lines of preference and the in- 
difference surfaces form orthogonal systems. A line of preference 
cuts an indifference surface at right angles and indicates the 
'' quickest ’’ way from one indifference surface to the next, i.e. the 
direction in which the individual’s utility level is increased most 
rapidly. 


EXAMPLES XVI 


Differential equations 


1, Draw a graph showing a sufficient number of the tangent directions 

= - to indicate the curve system which is the integral of the equation. 
y dv V 

Draw a similar graph of ^ ^ • Check by integrating the equations by 


the “ variables separate ” method (see 16.2 above). 


2. Illustrate graphically the differential equation ^ ^ and its 

integral. From the equation, prove that each curve of the integral system 
has a maximum point at the same value of x. 


a 

3. If show that {y - l)dx+ {x- l)dy = 0. Conversely, show 

that the differential equation has the integral shown (using the, “ variables 
separate ’* method). What are the curves represented? 


* Preference directions were introduced by Edgeworth, Mathematical 
Psychics (1881, reprinted 1932) and by Irving Fisher, Mathematical investi- 
gations in the theory of value and prices (1892, reprinted P92’)). In the case of 

two goods, indifference curves are defined by the relation ^ - 22 (a?, t/) and 

dy 1 

preference lines by the relation — = :=r-; : . 

^ dx R(x,y) 


83 jtems of indifference curves and lines of preference, 
the individuars scale of preferences. 


We have, in all cases, orthogonal 
Either system defines 
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4. Show that t/ = a(a; + a) is a certain system of straight lines and eliminate 
a to obtain the differential equation of the system. 

6, Obtain a differential equation of the second order, representing the 
curve system with equation ax* + by*= 1 (a and b parameters). 

6. The equation y = ax* + b represents all parabolas with axis along Oy, 
Find a differential equation to represent the curve system. Of what order 
is the equation? Consider, in the same way, the equation y = ox* + 6x -f c, 
representing all parabolas with axis parallel to Oy. 

7. Derive linear differential equations by eliminating a from the relation 

X 

a- 

a; 4 ^ = oe* and from the relation z — e^, 

8. Find an integral of each of the differential equations : 

x* dx 4 1 /* dt/ = 0 ; 2ydx-\-xdy = (^\ y dx - (x^ - \)dy = 0 ; 2^4 34/2/ = 0. 

9. If a is a given constant, obtain an integral of 

dy <x~x 
dx X 


Show that the curve system obtained can be represented in either of the forms 
= constant and x- cl log x 4 log y = constant. 

10. Show that y — ae^'^ is an integral of = 

1 4 

11. Show that the integral of (t/* - l)da; - 2dy = 0 is t/ = y — — . 


12. Integrate (2a;4t/)da;4 (a:4 2y)dt/ = 0 by showing that the left-hand side 

of the equation is an exact differential. • 

13. Show that ydx-\-(2x + y)dy = 0 becomes exact on multiplying through 

by Sy. Hence integrate the equation. , 


14. Find the partial derivatives of 
equation 


x*4y* 

3xy 


and hence solve the differential 


y(2x* -y*)dx-x(x^ - 2y^)dy~0. 


^ ^ dx dy dz ^ 

15. Integrate — = 0. 

X y z 

16. Show that a differential equation of the form 

* <f>{x)dx+ iliiy)dy + x{z)dz = 0 

is always integrable. How is the integral obtained? 

17. \Vhick of the differential equations 

yzdx + oczdy + xydz = 0; dx - zdy - 2ydz = 0 ; 
zdx-\- (x-\-y)dy + dz = 0 and y{y-\-z)dx + x{x-z)dy-\-x{x + y)dz = 0 
are integrable and;which not? 

18. Integrate 2x dx 4 z dy 4 y dz = 0 by showing that the left-hand side is 
an exact differential. 


19. Find the peurtial derivatives of 


x(y + z) 


x + y + z 

Hence integrate (y 4 «)* da; 4 a;* dy 4 x* dz - 0. 


with respect to x, y^and 
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20. Find the integral of the simultaneous differential equations 

xdx — ydy — z dz. 

Examine this integral and that of example 15 above in diagrammatic terms 
and show that the curve and surface systems concerned are orthogonal. 


Economic applications of differential equations 

21. It is known that the elasticity of a demand law x — <l>{p) is of the form 
(a - bp) where a and b are given constants. By integrating a differential 
equation, show that the law is x = p^e-f>(P-^^\ c being an arbitrary constant. 

22. If n is the total cost of an output x, it is known that marginal cost 
( always equals average cost • Integrate and show tliat 77 is a 
fixed multiple of x, i.e. that average cost is constant. 

23. If y is the number of incomes of it is found that y decreases as 
X increases according to the law ~ = - m- where m is a given constant. 

ax X Q 

Integrate and show that the dependence of y on a; is y = — . 


24. The price of tea, initially 3s. per lb., is p{t) ponce per lb. after t weeks. 

The demand is a; = 1 20 ~ 2p + 5 ^ and the supply a; = 3p - 30 + 60 ^ thousanrl 

lbs. per week. Show that, for demand always to equal supply, the price of tea 
must vary over time according to the law p = 30 + What are the prices 

after 10 and after 60 weeks? Draw a graph to show the approach of the price 
to the equilibrium value of 2s. 6d. per lb. 


'25. The demand (per unit of time) for a good is a; = op 4- 6 and the supply 
(=ap-tp where p is the price. If there is (e.g.) an excess of demand over 
supply in any period, it is assumed that the price changes to decrease the 
excess at a rate proportional to the excess. 

If p (t) is the price at time f, show that the assumption implies that 






dp 


where k is some positive constant. Deduce that p {t) is given by — + ^ (p - p) = 0 
_ 6 — i3 

where p = . If po is the initial price (t = 0), show that p{t)=p + (p© - p) 

(X — a 


and that the price tends to the equilibrium value p. 
Introduction to Economics (1930), p. 48. 


See Evans, JM^athematical 


26. An individual’s preference scale for two goods X and Y is defined 

by the marginal rate of substitution of Y for X, Show that 

w = (x - a)* + (y - 6)* is one form of the utility function. 2 / ~ ^ 


27. If 7? = ^ ig the marginal rate of substitution of Y for X, show that 

011.5 fonii of the individual’s utility function is w = (x4-o)*(y + 6)^ where a, 6, 
(X and P are given constants. 
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ax “f* Ij'ij “f" c 

28. The marginal rate of substitution of Y for X is /? = where 

oa; + ^2/ + y 

the coefficients are given constants. If a = 6 , show that one form of the 
utility function of the individual is quadratic in x and y. If a ^ 6 , show that 
the utility function need not be of this form. Illustrate by considering the 
particular cases of the two previous examples. 

29. Three goods X, Y and Z are said to be “ independent ” in consumption 

if the ratios of the individual’s marginal rates of substitution \ : R^: are 

of the form <j)(x) : ^{y) : x{^)> where <f>, tp and x are functions of the single 
variables named. Show that the integrability condition is always satisfied 
and indicate the form of the utility function. Illustrate when 

1 : : R^ = (x-a) :(y-b): (z-c). 


30. The marginal rates of substitution of X for Y and Z are known to be 
ratios of linear expressions in x, y and z, so that 

(a^x + h^y + c^z ■\-d^)dx->r (a^x + b^y + c,z + d^)dy-\- (a^x + b^y + c^z-^ d^)dz = (i 

is the differential equation of the indifference plane. What is the condition for 
the existence of indifference surfaces? Show that the condition is satisfied 
if 6j = aj,, 03 = 63 and a^ = c^. In this case, show that the utility function can 
be written as a quadratic in x, y and z. 



CHAPTER XVII 


EXPANSIONS, TAYLOR’S SERIES AND HIGHER ORDER 
DIFFERENTIALS 


17.1 Limits and infinite series. 

We have seen (4.1 above) that there are two broad classes of number 
sequences. A sequence either tends or does not tend to a finite 
limit, the second case including sequences tending to infinity and 
sequences which oscillate. For example, 

h h h if. ••• "^1. 

1, 4, 9, 16, ... ->oo . 

The nth members of these sequences are ^1 - and n® respectively 
arid the results can be written in the alternative forms 

” l-i->l as n->oo and n®->oo as n->a) . 

2 " 

Here n represents any positive integer. 

A niunber sequence can be treated in a slightly different way by 
forming from it a second sequence, a sequence of differences. The 
nth member of the new sequence is the difference between the nth 
and the (n - l)th members of the old sequence. The new sequence is 
written down with plus signs between successive members (to indi- 
cate our intention to add them together), continued indefinitely and 
called an infinite series. For example, the two sequences above give 
rise to the infinite series 


111 1 

2 + --f-g-h...-l---h..., 


l + 3-|-6-)-...-|- (2n — 1) + ... p 

v^ere*ihe expression in n represents the nth term of the series in 
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each case (n being a positive integer). From the definition, the sum 
of n terms of the infinite series equals the rath member of the original 
sequence. So, for any positive integral value of n, 

111 1,1 

2 + ^ + g + ...+^„ = l--, 

In general, if /(n) is the nth member of a given sequence, then the 
infinite series of difierences has sum of n terms equal to f(n). 

If the original sequence has a finite limit, then the sum of n terms 
of the derived series has the same limit. The series is said to be 
convergent and the limit of the sum is the sum to infinity. If the 
original sequence has no limit, neither has the sum of n terms of the 
series. The series is said to be divergent and there is no sum to 
infinity. For example, the series first written above is convergent 
with unity as the sum to infinity ; the second series is divergent, 
the sum of n terms tending to infinity with n. 

The limit of any sequence (if it exists) can thus be written as the 
sum to infinity of a convergent infinite series. Any member of the 
sequence and the sum of any number of terras of the series can then 
serve as an approximate value of the limit, a fact which provides one 
of the main uses of a convergent series. If A is the sum to infinity of 
a certain series, then A is given by the limit of the sum of n terms and 
found approximately by adding together a sufficient number of 
terms of the series. This may bo the simplest way of getting an 
approximate value of A if the latter is a complicated expression. 
Again, the sum to infinity provides an approximate value of the sum 
of a large number of terms of the series. The sum of (say) 100 terms 
of the series (I + J-1-8 + ***) is not easily calculated. But we know 
that the sum to infinity is 1 and this can be taken as the approximate 
value of the sum of the 100 terms. 

Conversely, if we are given a certain infinite series, we can form 
a sequence of sums, the nth member of the sequence being the sum 
of n terms of the ^eries. By definition, the given series is convergent 
if the derived sequence tends to a finite limit. 

Two important cases of series can be considered in detail. A 
geometric series consists of terms which increase (or decrease) 
from one term to the next by a given factor, called the 9omrMn 
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ratio. In general, if the first term is a and the common ratio r 
the series is 

a-\-ar + ar^ + ar^ + ... + ■\- . . . . 

It is shown, in elementary algebra, that the sum of n terms is 

(n = l, 2, 3,...). 

1 — r 


The derived sequence is thus 


1-r ’ 


1 -r" 


If r is numerically less than unity, the expression r” becomes smaller 
and tends to zero as n increases indefinitely. Hence, 


^ a a 

S = r” 

^ 1-r 1-r 


as w -> 00 . 


The geometric series is convergent with sum to infinity , pro- 

vided that r is numerically less than one. In other cases, the expression 
r** does not tend to zero as n increases and the geometric series is 
not convergent. The sum of n terms either tends to infinity or 
oscillates. 

Here a = i, r = J and = 1 ---->1 as n-^oo . 

The series is convergent with sum to infinity 1 (as obtained above). 

^ Ex. 2. The recurring decimal Od is a short way of writing the sum 
to infinity of the series + . 

This is a convergent geometric series with a = r = 3^. The sum to infinity 

a \ j 

is ^ = -• Hence, the decimal form of the fraction is 0 * 1 . 

1 r y 

Ex. 3 . An income stream of £a starts next year and continues for 
n years. Reckoning interest yearly at lOOr per cent, per year,* the present 
value of the stream (see 9*7 above) is 

a a a a 

ITr"^(l +r)2^a’+rp'^’”‘''(TTr)P , 


+ r ^ 1 r t \1 +r/ J 
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If the income stream goes on for ever, the present value is obtained by 
letting n->co , i.e. it is the sum to infinity of the convergent geometric 
series above. Hence, the present value of £a per year for ever, at interest 

at lOOr per cent, reckoned yearly, is For example, the present value 

of £3 per year for ever is £100 if interest is reckoned yearly at 3 per cent, 
per year. 

An arithmetic series consists of terms which increase (or decrease) 
from one term to the next by a given amount, called the common 
difference. If the first term is a and the common difference d, the 
series appears 

a (a d) (a 2d) + + — Id) • 

It is easily shown that the sum of n terms is 
*S’„ = |n{2a + (n- l)d}. 

Since *$'^->±00 as n->Go , the arithmetic series is not convergent and 
never possesses a sum to infinity. 

Ex. 1. l+2+3 + ...-f-n + ... . 

Here = in(n + 1) the sum of the first n positive integers. 

Ex. 2. 1+3+5 + .*. + {2n — 1) + ... . 

Here = |n{2 + (n - 1)2} = 71 ^ the sum of the fust n odd positive 
integers. 


17.2 The expansion of a function of one variable (Taylor's series). 
By a process of long division, we find 

1 

= l+ a; + a;2 + ...+a:”+ 

\ ~x \ -X 


If X has a value numerically less than one, the term containing 
decreases and tends to zero as n increases indefinitely. Hence, 


=--l +a; + a;2 + .., +a:” + ... , 

• \ -X 

if X is numerically less than one. We have thus expressed ® 

the sum to infinity of a convergent infinite series. The result can 
be checked in this case. The series written is a geometric series 
with common ratio x (<1) ; the series is convergent with sum to 


infinity equal to (by the formula of 17.1). 

L “ X 
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The advantage of the series expression for is clear. If a; is 

given a small value, then, instead of evaluating t-— by long division, 

we find its approximate value (to any required degree of accuracy) 
by adding a sufficient number of terms of the series. For example, 

if 

OA 1 

~ = + + 0*03333 + 0001 1 1 + 0*00004 + . . . 

29 1 -X 

= 1*03448. 

Hence to four decimal places, ff = 1*0345, as can be verified by long 
division. The fifth term of the series is too small to affect the 
approximation and is not included. Again, if a: is a small number, 
then to a degree of accuracy represented by x^ 


1 

= 1 +a: + a:*, 

1 -a; 

i.e. for approximate purposes the expression — -i— can be replaced 

A ““ a? 

by the simpler quadratic expression (l+x + a;^). 

The problem can be generalised. Given a single-valued fxmction 
/{x), it is desired to find a convergent infinite series, the successive 
terms of which involve increasing powers of x, with f(x) as its sum 
to infinity. In symbols, we wish to write 

f(x)=aQ-hajX-^a 2 X^-i-asX^-j-... (to infinity), 

where the a’s are certain constants. The problem is a little complex 
since we have first to see whether the process indicated can be 
carried out at all, then to determine for what values of x it is 
valid and finally to allot the appropriate values to the constant 
coefficients. 

If the process is possible, the series obtained is called the expansion 
of f(x) as a. power series, i.e. as a series in ascending powers of x. 
The particular case above indicates the uses of the expansion when 
obtained. For a definite value of x, the accurate value of f{x) is 
obtainable only by substituting directly in f{x) or by finding the 
limit of the sum of n terms of the series. But, if approximate results 
suffice, we need only add a hmited number of terms of the expansion. 
K is possible, for example, to replace f{x) approximately by a quad- 
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ratio or cubic in x when x is small, a great simplification i£f(x) is at 
all complicated.* 

It is assumed, in the following, that f(x) has continuous deriva- 
tives up to any desired order at the points concerned. We start 
with a simple case where the problem is really solved in advance but 
which will indicate the more general method. Suppose that 

f{x)=:aQ-\-a^x + a^^ + 4 - . . . + 

a given polynomial of the nth degree in which the coefficients have 
known values. By successive derivation, 

f'{x) =ai 4- 2a2X 4- 4- ... 4- 

/"(a;) = 2 ^ 2 4 - 6 ( 230 ; 4-... 4-n(n~ l)anX^-^, 

/"'(a:) = Gag 4 - ... 4 -n(n - 1 ) (n - 2)af,x^-^, 


/(«)(a;)=n(n - l)(n~2) ...3.2. 

Putting a; = 0, 

/( 0 )=ao, /'( 0 )=ai, /"( 0 )= 2 a 2 , /'"(O) - Gag, , 
and /<^)( 0 )=n(n - l)(n - 2 ) ... 3 . 2 . la,,. 

A convenient algebraic notation can be introduced here. The 
product of the positive integers from 1 to n inclusive is called n 
factorial and written [^. Hence, from the results above, 

«.=/(0), Oi=/'(0), a, = 4r(0), o 3-4/'"(0),... a„=|^/<"’(0)‘ 

The original polynomial can thus be written in the form 

f[x) =/( 0 ) +/'( 0 )a; 4 -/"( 0 ) ^ +/"'( 0 ) ... +/(-)( 0 ) ^ . 

A function which is not a polynomial cannot be expressed exactly 
in this way. But we can always find the difference between a given 
function /(.'{;) and the series written above and determine whether 
the difference is large or small. To be quite general, let n be a fixed 
positive integer and a;=a a fixed value of the variable x. Then, for 
the given functipp jf(a;), write 

/(o +x) =/(a) +f'(a)x+f"(a) ^ + ••• +/<“>(“) ^ +-Kn {x), 

• * This is an extension of the method of approximations to the vf^ue 
a fimotion by a linear expression (see 6.4 above]^ 
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where R„{x) ia the difference, or “remainder”, between f(a+x) 
and the series written. The fundamental result is that 

■Bn ix) =/'"+!> (a + ex) , 

where 0 is a positive fraction. In other words, is the term we 

expect to follow f^^Ha) p except that the value of the (n + l)th 

derivative is taken, not at a, but at some point (not otherwise speci- 
fied) between a and (a + x). 
Without giving a complete 
proof of the result,* we can 
illustrate (in diagrammatic 
terms) the simple but im- 
portant case where n is 
zero. The curve y=f{x) is 
continuous with a tangent 
at all points (Fig. 102). Let 
A and P be points on the 
curve with abscissae a and 
(a + x) and ordinates f{a) 
and/(a + x). The chord AP 

has gradjent 

X 

referred to Ox, Since the curve passes continuously from A to P, it 
is clear that there must bo a point Q on the curve between A and P 
where the tangent is parallel to AP. Let ^ be the abscissa of Q so 
that ^ is between a and (a + x), i.e. = a + 0x for some fractional value 
of 6. The tangent gradient at Q is /'(^) =/'(a + 6x), So 

i<2±^::£5>=/'(«+te), 

i.e. /(a + a:)=/(a)+Po(^) where Po(x) =/'(a + flx)x 

and our result for w = 0 is obtained.f 

♦ The rigid proof of this result is complex. See Hardy, Pure Mathematics 
(3rd Ed, 1921), pp. 262 et seq. 

t The result when n = 0 ia known as the Mean Value Theorem and its 
meaning can be expressed as follows. The mean increment in the function 

/ (x) for the increment in x from a to (a + x) is ^ ^ • As x -► a, this 

c.'atio 4^nds to /'(a). Our result shows that the ratio equals the derivative oi 
/(x) at some point between a and (o + x) even when x has not a small value. 
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Consider, now, the result of letting n take larger and larger 
integral values. The number of terms in the series for f(x) then 
increases, i.e. we are deriving an infinite series. All now turns on 
the hmiting tendency of the “ remainder ’’ Bn{x). If, for a certain 
value of Xy Rn{x) tends to zero as n tends to infinity, then the 
infinite series we obtain is convergent and wo have 

f(a + x)=f(a)+f'(a)x+f''{a)^+f'''(a)^ + ... (to oo ). 

On the other hand, if i?„ (a:) does not tend to zero, then the infinite 
we obtain is not convergent and cannot have f{a + x) as a sum to 
infinity. The broad outline of the solution of the problem of ex- 
panding f{a-{-x) as a power series is now clear. The possibility of 
the expansion turns on the behaviour of i2„(a:) as n->oo . 
may tend to zero for some values of x and not for others. The 
former values of x are those for which expansion is possible. Once 
this has been determined, the actual expansion is the series written 
above. 

Collecting our results : 


The function f{x) has finite and continuous derivatives of alj • 
orders at a; = a. For a given positive integral value of n, 

/(. + *) =/(o) +/•(,.) X +/"(«) + ... +/»>(«) +/l"«>(a + te) , 


where 0 is some positive fraction. Further, 


f(a + x)=f{a) +f'{a)x +f''(a) ... +f’'>{a) + ... , 


for any value of x such that f^^+^^a + Ox) 


^n+l 


n + 1 


-> 0 as n-^co , 


The expansion off{a + x) shown is known as Taylor^ s series. 

To summarise the position, the values of / (x) and its derivatives 
are known at the jp^int x = a. It is then possible, subject to conditions 
named, to express the value of the function at any point (a-\-x) in 
the neighbourhood of x=a as a series in ascending powers of x, the 
coefficients involving only values at x=a. In short, the values o^ 
f(x) and its derivatives at a;=a provide also the value of f{x) ali any 
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neighbouring point. The particular point a;=a, in the neighbour- 
hood of which the expansion holds, can be chosen at will. By 
selecting different points we get different expansions, e.g. 

m =m +f'{o)x+r(o)^+... (a=o), 

f{l+z)=f{l)+r(l)x+f"{l)^ + ... (a = l). 

Finally, it is to be stressed that each expansion holds only for certain 
values of x. As extreme cases, it may be that f(a + x) can be ex- 
panded for no values of x other than x = 0 (the most unfavourable 
case) or that f(a+x) can be expanded for all j&nite values of x (the 
most favourable case). 


17.3 Examples of the expansion of functions. 

The most useful and frequently quoted cases of expansions are 
considered in some detail in the following examples : 


Ex. 1. If f(x) =x^ (r any real number), then 

/'(x)=rx--i, /"(x)=r(r-l)x^-*, ... . 

In general, =r(r - l)(r -2) ... (r + l)x’’“". 

Taylor’s series (a = 1, n = 1) gives 

(1 +xY = l+rx+Xx^, 


where A = ^(r - 1)(1 is finite even when x=0. This gives a 

convenient approximate expression for (l+x)*" when x is small. More 
generally, 


(1 = 1 ... . X* . ... ^r(>-l)(--2)-(r-n.l) ^ ^ 

provided that 


■Rn(») 


r(f-l)(r-2)...(r-n) 
|» + 1 


(1 +0xY-”-^x'^^ 


.r(r - l)(!r - 1) ... (I - 1) (1 


tends to zero as 7i tends to infinity. If x is positive and less than unity, 
then x”+^~>0 and (n + 1) (1 + 0 x)”+^->qo as n->oo . Hence, i2„(x) as written « 
albove must tend to zero. The same result holds, but is more difficult to 



EXPANSIONS, TAYLOR’S SERIES* 455 

prove, if x is negative and numerically less than unity. We thus have the 
binomial expansion. 


If fe 

for any value of x which is numerically less than one. E.g. 


Vl + a; = 1 + Ja 




To obtain an approximate value of such a binomial expression, we have 
only to add a sufficient number of terms of the series. So, 

^/^=v^TA=l+JA-i(*)»+*(A)»+... 

= 1 +0-05-0-00125 +0-0000625 + ... 

= 1 -0488 to four decimal places. 

The fifth term of the series is not needed here. 


Ex. 2. All the derivatives of e* are equal to e*. 
series with a *= 0, 


* 1 ^ ^ 



Hence, by Taylor 


^n+l 

provided that as n->Q 0 . 


The latter is seen to hold for any finite value of x whatever since 
I n + 1 = 7i(n - l)?i(n -2) ...3.2.1 


increases more rapidly than any power as n tends to infinity. The 
series written above thus holds for any finite value of x and it is called the 
exponential series. 

If aJ is small, the exponential e* can be replaced approximately by the 
quadratic (1 +a; + or by the cubic (1 + x + Ix^ + 6^^). The latter gives 
e* to a degree of approximation represented by x^, E.g., if a; =0-1, the 
cubic expression certainly gives c* correct to three places of decimals. 
We must include more terms when the value of x is larger or when greater 
accuracy is required. 

Taking a; = 1, it is foimd that the series 

* • 


e = l+l+^ + |3+. 


^ gives c correct to five decimal places when ten terms are added. Noting 
that each term can be obtained, as a decimal, by dividing the ittevioSs 
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term by 2, 3, 4, ... , the ten terms are easily evaluated each to six decimal 
places as folloTvs : 

e = 2-5 
+ 0-166667 
+0-041667 
+0-008333 
+ 0-001389 
+ 0-000198 
+ 0-000025 
+ 0-000003 
= 2-718282 

l.e. € =2-71828 correct to five decimal places. 


Ex. 3. If f{x) = log a:, then 




/'»=-; 


In general, y(n) (a;) = ( - 1)^-1 1=^ . 

Taylor’s series with a = 1 gives 

provided that R„{x) =(-!)" = < " 1)" (n + l){l +ex)”^^ 

tends to zero as n->oo . This condition (exactly as in Ex. 1) is satisfied 
if X is numerically less than one. The series written above, the logarithmic 
series, is valid for such values of x. 

Here again we can replace log(l+a;) by an approximate quadratic 
or cubic expression if a; is small. The series gives a method of finding 
approximate values of particular (natural) logarithms. E.g. 
log M «log(l +*)=*- J (*)« + i (*)3 - i + ... 

=0-1 -0-005 +0-00033 -0-000025 + ... 

= 0-0953 to four decimal places. 


17.4 The expansion of a function of two or more variables. 

The function z=f{x, y) is assumed to have contipuous and finite 
partial derivatives up to any desired order at (a, 6). 

Write <l>{t)=f[a-\-xt,b-\-yt)—f{UyV) (u^^a-^-xt, v—b-Vyt). 
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=. ( 


dj du BY dv\ 


du^ dt ^ du dvdt) dv dt ^ dv'^ dt) 


BY du BY dv\ 


-x^ - -' i -I 2x7/ - -[ if - 

du~ du du Bv^ 

and so on. In general, we find 

J(n)/A _ J.n + nx"-l U 4- - ~ l-n-S ?/2 

^ Bii-^ Jda-'Bv^ \2 ^ y 


dnf 


3ti’'~'^ Bv^ 


*4“ •• 


H- nxj/^ 


BY 


dx dv 




d^f 

n •{_ 

dv^ ’ 


where the numerical cochicicnts follow the binomial form 
indicated above (17.3, Ex. 1). By Taylor’s series for (/>(/), 


12 fix /n+1 

<i>{t) = <i> (0) + 4 ,' (0) < + f ' (0) ^ + ... + (0) T- + (^^0 

71 


71+1 


Putting < = 1 and substituting the values of (f){t) and its derivatives 
(all evaluated at < = 0, so that u — a and v — b), we have 


/(»+*, (.+y)=/(«,4) + (x|?4!,|)^ 




J'-J 


d'i 


dx 


ax 


"0 

'ir! a, b 


^ f SY , dY dY\ „ , , 

+ IT, r al- +“ » aT^'lT a,-), .+ *■ '*■ »>• 


where 


R — J- + ;nri a;";/ + -H 


' 0+0*, 6+01/ 


d being some positive fraction. The notation adopted here needs a 
httle explanation. The values a and b appearing as suffixes of 
successive t^rms are to be inserted for x and y in the partial deriva- 
tives of f{x, y) but not in the powers of x and y included in the 
terms. Thus 


and similarly for the other terms. Again, in the expression for 
(^» y)i the values a + 6 x and b + 9x are to be substituted for a;^and 
in the (n + l)th order partial derivatives concerned. 
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We have now obtained an extended form of Taylor’s series and 

/(a+*. 4+y)=/{<., 6)+ (»! + !, |)^^ 

provided that y)->0 as n~>oo . 

This is the expansion o{f{a-\-x, 6 -f y) as a series in ascending powers 
of X and y, the coeflScients being the values of the function and its 
partial derivatives at the point (a, 6). The expansion holds only for 
those values of x and y which make y )-->0 as n-^oo . It is to 
be noticed that the terms of the expansion are grouped so that each 
contains all powers and products of x and y of a certain degree, the 
first term being linear in x and y, the second term quadratic and 
so on. Successive terms are variations of higher and higher 
orders. 

The use of Taylor’s series in expanding and finding approximate 
values of given functions is exactly as before. If x and y are small, 
then /(a + ic, 6-fy) can be replaced approximately by a quadratic 
or cubic expression in x and y by retaining only the first few terms 
of the series and the complex later terms are not needed. 


Ex. All the partial derivatives of are equal to 
Fixing the point (0, 0) and applying Taylor’s series, 


e»+» = l +(x+y)+|^(a:*+2a;y+y*) + |^(x»+3a:*y + 3a:y*+y®) + ... , 


provided that R. (x, y) -- e® <»+■' >-> 0 as n- 

n + 1 


•-00 . 


The condition holds for all values of x and y (as in 17.3, Ex. 2). So 


c*+» = l+(x+j/)+-^(x+y)*+-^(x+y)» + ... , 


which agrees with the ordinary exponential expansion on substituting 
(a; +y) for the index. * ^ 


Taylor’s series can be extended to cases where more than two 
variables appear. As the number of the variables increases, each 
term of the series involves more and more powers and products. 
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In general, if /(a^, **, ... a:„) is a function of n variables and if 
(Oj, a„ ... o„) is a fixed point, then 
/(Oi + Xi, a^+x^, ... a„+a:„) 


“ “■> + ^ sf, + - +*■ S') 


’ ax,* 


OiOt ... 

dH 

^ + 2xiXgg^^ 


•f- _ “h ••• “f* 1 "j"..*! 

aa;i 3:^3 ^ ” dx^_^ ... «« 

subject to conditions similar to those already given. Our results are 

thus perfectly general and the only diflBculty is the labour involved 

in writing the appropriate series. 


17.5 A complete criterion for maximum and minimum values. 

In addition to its practical use in the expansion of functions, 
Taylor’s series is a powerful theoretical tool of service in such 
problems as that of maximum and minimum values. We can, first, 
complete the tests for extreme values of a function of one variable 
(8.2, 8.3 and 8.6 above) and then, in a later chapter, use similar 
methods to develop tests for functions of several variables. 

A convenient form of the definition of extreme values is : 

# 

The function f(x) has a maximum (minimum) value at a;=aif 
f(a-\~x) -/(a) is negative (positive) for all small values of x. 


It is again assumed that the function has finite and continuous 
derivatives up to any desired order at x=a. Taylor’s series gives 

f{a +a;) -fia) =f'(a)x +f"{a)-^ + ... +/(")(a) + dx) 


n + l 


for any fixed positive integral value of n. This form of the series 
is clearly oi^ direct application to our problem. 


(1) Suppose /'(a) v^O. 

By the continiyty of the derivative, /'(a + Ox) has the same sign 
as /'(a) for a certain range of small (positive and negative) values 
of x. Now 


f(a-\-x)-f{a)-f\a-\-Ox)x (n = 0) 
and so f{(i-\-x)-f{a) must change sign as x changes sign. The 
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point x=a cannot, therefore, give an extreme value oif(x). Hence, 
if a:=a gives an extreme value of f(x), we must have/'(o)=0. 

(2) Suppose /'(a) = 0, /"(o)v^O. 

As before, /"(a + has the sign of/" (a) for a certain range of 
positive and negative values of x. But 


/(a + x) -f(a) = f'{a) x +/" (a + dx) ^ )> 


i.e. f{a + x) -f (a) has the sign of f''(a + Ox), i.e. of /"(a). Hence, 
lf/'(a)=0 and/"(a)<0, x=a gives a maximum value of /(a;). 
If/'(a) = 0 and/"(a)>0, x=a gives a minimum value of f{x). 


(3) Suppose /'(o) =/"(a) =0, /'"(a) 

Again f'''{a + 6x) has the sign of f'”(a) for sufficiently small 
positive and negative values of x. So 

f{a+x) - f{a) =r{a)x+r{a) g +/"'(« + Ox) ~ 

=-\f"'{a + ex)x^ (n = 2) 


shows that f{a'\-x) -f{a) changes sign as x changes sign. The point 
which is a stationary point, is thus a point of inflexion and 
does not give an extreme value off{x). 


• (4) Suppose/'(a)=/"(a)=...=/<"-i>(a)=0, 

Taylor’s series can now be written 

f{a+x) -f{a) =/<”>(a + 6x)^- 

Of the two terms on the right-hand side, the derivative has a constant 
sign, that of for sufficiently small (positive or negative) 

values of x. The other term changes sign witli a: if ti is an odd 

integer and is always positive if n is an even integer. In' the former 
case, /(a + a:) ~/(a) changes sign as x changes sign and x=a must be 
a point of inflexion. In the latter case, f{ai-x) -f{a) has a constant 
sign and x=a gives a maximum or minimum valtlie«of/(a:) according 
08 (a) is negative or positive. Hence, 


COMPLETE CRITERION FOR MAXIMUM AND MINIMUM VALUES 

^ • • 

(1) l{f{x) has an extreme value at x—a, then /'(a) = 0. 
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(2) If /'(a) =/» = ... =/(-i){a)=:0, /<->(a)^0, then f{x) 
has a stationary value at a:=a which is an inflexional value if 
n is odd, a maximum value if is even and /^”^(a)<0 and a 
minimum value if n is even and /^”Ha)> 0. 


Subject to the condition that the derivatives involved are finite 
and continuous, the criterion is complete and so both necessary and 
sufficient. There is no case of failure ; unless the function is a 
constant (and so without maxima or minima) there must always be 
some derivative which is not zero. The order and sign of this 
derivative determines the nature of the point considered. The 
practical method of finding maximum and minimum values (8.4 
above) is scarcely aflFected ; a few doubtful cases (such as that of the 
following example) are cleared up and that is ail. 

Ex. 

So |-4(.-l)', g-12(«-l)-, g-24(*-l) and g.24. 

There is only one stationary value, y = 0, occurring at a; = 1 . At this point, 
the first three derivatives are zero and the fourth derivative is positive.. 
Hence, y — 0 is a minimum value of the function. This can be checked 
since y is positive for all values of x except that it is zero when a; = 1. A 
graph of the function shows a curve very similar in shape to the parabolS, 
and with vertex at (1, 0) on the axis Ox, 


17.6 Second and higher order differentials. 

The differential of a function of several variables describes the 
“ first»-order ” variation of the function. But, just as derivatives 
of various orders are needed to describe completely the variation of 
a function ,of one variable, so something more than the ordinary 
differential is required in the case of a function of two or more 
variables. The need is met by the definition of differentials of the 
second and higher, orders. In the following, the definitions and 
results are given only for a function, z=f{x, y), of two variables, but 
they are easily extended to more general functions. 

The function z=f{x, y) is assumed to have finite and continuous 
^partial derivatives of aU orders required. The first-order vafiati(5h 
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dz 

is described by + whether the variables x and v are 

dx By 

independent or not. It is very important, however, to distinguish 
carefully between the two cases. If x and y are independent, then 
Bz Bz 

and — are functions of these variables but dx and dy are arbitrary 
Bx By 

and constant increments in the variables. But, if x and y are de- 

Bz Bz 

pendent on other variables, then — and — are also functions of the 
^ Bx By 

genuine independent variables and dx and dy are expressed in terms 
of the same variables plus their arbitrary and constant increments. 
In either case, dz involves functions of the independent variables 
(whether they are x and y or others) and certain constants, the 
arbitrary increments in the independent variables. The differential 
of dZy as a function of certain independent variables, can now be 
defined in the ordinary way and is called the second differential : 


d^z—d{dz). 

The process can be extended and, in general, the nth differential of 
z is defined as the differential of the (n - l)th differential : 

d^z=d(d^-^z). 

.4s in the case of derivatives, however, it is seldom that we need 
differentials of higher order than the second. 

Our problem now is to express higher order differentials in terms 
of successive partial derivatives of the function. The importance of 
distinguishing the independent variables becomes evident here. 
Unlike the expression for dz, which is the same in all cases, our 
results for the higher order differentials are different and sampler 
when X and y are the independent variables than they are in other 
cases. The reason for this is not far to seek. In differentiating dz 
to obtain d^z, we must know how to treat the dx and dy appearing 
in dz. If X and y are independent variables, dx and dy are constants 
and do not worry us. In other cases, this is not, so and due accoimt 
must be taken of the variation of dx and dy m the differentiation 
process. Before beginning to differentiate, therefore, it is essential 
to know whether x and y are independent or dependent 
fariables. * 
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17.7 Differentials of a function of t'wo independent variables. 

Differentiating the first differential of a function of two inde- 
pendent variables, we have 



The essential point is that dx and dy are treated throughout as 
constant increments ; otherwise the ordinary rules of differentiation 
(13.3 above) are used. It appears, therefore, that d^z is a quadratic 
expression in dx and dy, the coefiicients being given by the second- 
order partial derivatives of z at the point (a;, y) in question. 

In finding second differentials in practice, we can either evaluate 
the second-order partial derivatives and use the above result, or we 
can differentiate twice by rule in the particular case considered. 
The latter method usually involves less labour and is adopted in 
the following examples. In each case, however, the results can be 
checked by means of the partial derivatives. 


Ex. 1. 2 = a;* + 2a:y - y*. 

Here dz = 2(x -\-y)dx ^2(x -y)dy 

and dH ^2d{x -^y)dx + 2d{x -y)dy = 2 {(dx + dy) dx^-(dx- dy) dy) 
= 2 (dx^ + 2dx dy - dy"^). 


Ex. 2. 2 =— 

. x-y + \ 

Here dz = + 


and d*z 


{d (x* - 2xy + 2x)dx + d (x^)dy) (x - y + 1 )* 

_ —{( 3 ^ -2xy + 2x)dx +x^ dy)d(x -y + \)* 

T’i (a:-y + l)* 

2(x - y + -2{(x* -2a:y + 2x)da: +x^dy] (dx-dy) 

- (a;-y + l)» 

{{y-l)dx-xdy}* 

(x-y + U* ■ 
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Ex. 3. z=^\og{x^ + 'if). 


j ji. „d(a:da:+y<i2/)(x* + y*) -rf(a:*+ 1/*) (*&+«/ (/y) 

(x^+y^f 

^ (dx^ 4 - drf) [x^ - 2(xdx + 'y dy)"^ 

(x2+t/2)2 

^ (x^ - y^) dx^ 4 - 4:xy dx dy - (x^ - if) dif 

(x^ 4 - 2 / 2)2 

The process of differentiation can be repeated to give 


dh = d 


i.e. = ^3 dx^ + 3 da;* dy + 3 ^^3 dx dy* + ^3 dy*. 

This is a cubic expression in dx and dy with coefficients given by the 
. third-order partial derivatives of z. Higher order differentials are 
obtained in exactly the same way and it is to be noticed that the 
/lumerical coefficients of successive terms follow the “ binomial 
law (17, 3, Ex. 1 , above). 

We can note here a connection between successive differentials of 
a function of independent variables and the corresponding terms of 
the Taylor expansion of the function. Assuming that Taylor’s series 
• is valid, it can now be written in the form 

d^z d^z 

Az=f(x+h, y + k)-f{x, y)=dz + -r- + j- + ... , -* 

^ [3 

where the differentials of z are to be taken at {x, y) and with respect 
to arbitrary increments dx = h and dy = k in the variables. Hence, 
the increment Az in the function for increments h and k in the 
variables from {x, y) is compounded of the successive differentials, 
dZy dHy dhy ... , of the function at {x, y). The complete variation of 
the function is described only by using differentials of all orders. But, 
the successive differentials involve higher powers of the increments 
Jl an(f I;, i.e. they are of higher order of “ smallness ” if A and k ard 


dx^ dy 


dy) dy*. 
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small. For approximate results, only a few differentials (i.e. a few 
terms of Taylor’s series) are needed to describe the variation of the 
function. As a first approximation Az=^dz\ as a second approxi- 
mation Az—dz-^\d^z \ and so on. 

The higher order differentials of a function of a single variable, 
y=f(x)y appear as a special case of the general process of differen- 
tiation described above : 

dy=f(x)dx, d^=f"(x)dx\ dhj=f"(x)dx\ .... 

In general, d^y = dx^. 

The first derivative, £is we have seen, can be interpreted as the 
ratio of the differential dy to the differential dx. This result now 
extends. The second derivative is the ratio of d^y to dx^, i.e. the 
ratio of the second differential of the dependent variable y to the 
square of the differential (or arbitrary increment) of the independent 

d^y 

variable x. The notation for the second derivative is thus 
justified. Similar results hold for higher order derivatives. 


17.8 Differentials of a function of two dependent variables. 

In the function z=f{x, y), the variables x and y are dependent on 
a set of independent variables u, v, w, ... . The process of obtainmg 
the second differential d^z now proceeds : 


d’z^d + =d (g*^) +<1 (!<!») 
='‘ (|) dy+p/¥y) 




d^x + 



i.e. d-z = g dx- + 2^dxdy + g dy- + 1 + £ d-y. 


dy 


The expression for d^z now includes two additional terms, the terms 
in the second differentials of x and y as functions of the independent 
variables u, v, w, ^.... If x and y happen to be independent variables, 
these terms are zero (since dx and dy are constants) and the expres- 
sion for d^z reduces to that previously obtained. In general, however, 
d^x and must be retained and interpreted in the light of tl^ 
particular problem considered. 

a M.A. 
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Two simple applications of the result can be noted. If z=f(Xy y) 
where z and y are functions of a single variable the partial deriva- 
tives and derivatives of the fimctions being known, then from 

= ^ dx-^~ dyy we derive (13.4 above) : 
ox oy 

dz _ dz dx dz dy 
dt dx dt ^ dy dl 

3^2 d‘^z d^z dz d^ 

From d^z = ^ dx^ -f- 2 dxdy + ^ dy^ + 3 ,- d^x 4 - ~ d^y, 

dx^ dx dy dy^ dx oy 

we derive 

d'^z ^ d^z / <ia;\ * dH dx dy d^z / dy\ ^ dz d^z dz dhj 
dt^ dx^ \dt) ^ dxdy dt dt dy’^ \di) ^ dx dt^ ^ dy dt^ 

The ratio of d^z to dt^ is to be interpreted as the second derivative 
of 2 ; as a function of t (defined by means of x and y). We have thus 
a simple formula for this derivative in terms of the partial derivatives 
of z and the derivatives of x and y. The same result is obtained by 
dz 

taking as a function of t and by finding its derivative directly. 
(tt 


In particular, if z=f{x, y) where y is a function of x, 

dz 


and 


_ dz dz dy 
dx dx dy dx 
dH 


dH __ dH 2 
dx^ ~~ dx'^ ^ dx dy dx 


dy dH ldy\ 
dx dy^ \dx) 


* ^ d^y 


dy dx^ 


In this case, d^x does not appear in d^z since x is the independent 
variable. 

If t/ is an implicit function of x defined by the relation f{x, y) = 0 , 
we have seen (13.5 above) that 

f^dx-^fydy = 0y 

j f 

which gives the derivative Differentiating again, 

dx Jy 

d{f^dx-^fydy) = 0, 

i.e. /a.* dx^ + 2f„y dx dy -\-fyy dy^ +fy dhj = 0, 

noting that d’^y, but not d^x^ appears since y is taken as a function 

of x. Dividing through by dx^^ 


\n. 


f +2r ^+f f^^V+z-^-o 

--l!f 4 - 2 /- + f 

- +/.. I . 


d^y 

dx^ 
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This gives the second derivative of y as a function of x. If the value 
d'u 

of previously obtained is substituted, the result already written 

CiOC 

(13.5 above) is again obtained. 

If the same relation is regarded as giving a; as a function of y, 

dx f 

then fg.dx-\-fydy = 0 gives the first derivative and from 

d {ffg dx + fy dy) ~ 0, we find ^ •' * 

^ XX dx^ + 2f^y dx dy +fyy dy^ +/* d^^x = 0, 
which gives the second derivative, 

d^x _ r dx ( 

~ ~Tx V*’' dy “ Kdy) ] ■ 

dx 

This can be expanded, as before, on substituting the value of -j - . 

dy dx 

It is to be noted that, though the first derivatives and — are 

dx dy 

reciprocal, the same relation does not hold between the second 
derivatives. 

In practice, the derivatives of implicit functions are best obtained 
by differentiating the relation between the variables as it stands, 
always remembering which of the variables is dependent : 

Ex. 1. a:*+y® = 16. 

Taking y as a double-valued function of Xy we have 


2* + 2y|=0. i.e. 1 = 


Differentiating again. 


2+2^$' 


IHtf] 


I Ix^+y* 


But, taking i aa a double-valued function of y, 

u.|=- 

»d 2(|)’ + 2»^^ + 2.0, 


iy^ *1 ^^dy) j x : 



468 


MATHEMATICAL ANALYSIS FOR ECONOMISTS 


Ex. 2. a^+y^-ixy=0. 

Regarding y as a, triple-valued function of x. 




So 




dy 


r*-« 




Differentiating again, 


giving 


d^y 

dx^ 




(y^-xf 

2xy 


{y {x* - yY + {x^ - y)(y'^ - x) + a: (y* - x)*) 
(x»-hy»-3xy + l)= -- 


{y^-xY' 

Similarly, if a; is regarded as a function of y, we find 

dx y^-x , d^x 2xy 
dy' 


y^-x ^ d^x _ 


x^-y 


Returning to the general case where z=f{x, y) and x and y are 
functions of certain independent variables, the third differential 
d *2 can be obtained by differentiating the second differential dH i 

d<z=d[^,d,:’ + t^d^dy + „dy‘ + ^d’l + ^d-y) 




dx^dy 

(S'** +5;''!') 


\dx^ 

+ 3 

dz dz 

+ — + 
dx dy 


dy^ 

d^z 


70 « I 7 

d^x + 3[ ^ dx + 

dx dy dy^ 


dy^ d^y 


on reduction. The expression of the differential has now become 
very cumbersome. Higher order differentials are obtained by further 
differentiation and are exceedingly involved. 

As a particular case of the above results, let y=f{x) where x is 
dependent on other variables. Then 

dy= f\x) dx, d^y = f*\x) dx^ +f\x) d^x. 


• • • • 
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For example, if a; is a function of a single variable t. 


^ = f'M ^ ^ - r(x) l~Y+ f'(x\ 


dt^ ’ 


469 ' 


This is the ordinary function of a function rule for functions of one 
variable, extended to the second and higher order derivatives. 


EXAMPLES XVII 
Infinite series 

1 . Write down the sequence of numbers with nth member T---; — ^ — ) 

V2 n+1/ 

and obtain the infinite series of which the sum of n terms is given by this 
expression. Show that the series is convergent to 

2. Find the infinite series with sum of n terms given by (a) n*(n+ 1) and 
(6) Jn(n+ l)(2n4- 1). Show that neither series is convergent. 


3. Write the series with sum of n terms — and show that it is convergent. 

4 . It is known that tt is the sum to infinity of the series 

Find the value of rt to five decimal places. 

6. Show that *y2 is the sum to infinity of the binomial series 

+ (cf. 17, 3, Ex. 1.) 

and deduce the value of js/ 2 correct to four decimal places. 

6. Express the recurring decimals 013 and 0*27 as fractions. • 


7 . Write down the sum to infinity of each of the convergent series 


4 1 1 


1 2 . 


32 


2+V2+l + ^ + i + 22 + -- 


Illustrate the approximation to the sum to infinity by finding the sum of ten 
terms in each case. 


8. ball is dropped from height h feet on a hard floor, bounces, falls, 
bounces again, and so on. Each bounce is to height e times the previous fall, 
e being a positive fraction. Show that the distance travelled by the ball 
before coming to rest can be represented as the sum to infinity of an infinite 
geometric series. If the distance is 2/i, show that e = J. 

9. An income stream, starting with £a next year, is such that each year’s 
income is half that of the previous year. Adding interest yearly at lOOr per 
cent, per year, find fh^ present value of the stream flowing for n years, and of 
the stream flowing for ever. 

10 . An income stream starts next year with £a and flows for ever so that 
each year’s income is a fixed percentage (100a per cent.) less than the previous 
gear’s income. Find the present value of the stream, adding interest yoai^y 
at lOOr per cent, per year. 
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Expansions 

11. If d is some positive fraction, use Taylor’s series (n = 0) to show that 

log ( 1 + a?) = and deduce that 

^ < log ( 1 + *)< * (x> 0). 

Then, from Taylor’s series (n= 1), show that 

2|j^p^<a:-log(l+x)<ia:* (x>0). 

12. For values of x numerically less than one, show tliat 

7 i ~ ~~ T « = ^ 3x*+ 4a;*-f ... . 

(!-*)• 

13. Write s/6 = W25- 1 and use a binomial series to calculate its value 
correct to five decimal places. 

14. Find a quadratic expression which approximates to — — _ when « is 

vl 

small. Put X = 8^0 Q-iid v'S correct to four decimal places. 

15. If a; < 1 numerically, write expansions for Vl + x, s/l -x and n/1 - x*. 
By algebraic multiplication of infinite series, verify that the expansion of a 
function which is the product of two parts is the product of the expansions of 
the separate parts. 

16. Form an infinite series each term of which is the integral of the corre- 
sponding term of the expansion of (1 in ascending powers of x. What 
is the series? Show that its sum to infinity is the integral of (1 + xj^K What 
general rule does this result suggest? 

• 

17. By taking sufficient terms of the appropriate series, find the values of 
and - correct to four decimal places. 

18. Show that log(n+l) = logn + (i-li + il-ll+...). 

Given log 2 = 0-6931, find values of the natural logarithms of 3, 4 and 6 
correct to three decimal places. (Note : log 4 can be found without using 
the infinite series.) 

19. If interest at lOOr per cent, compounded yearly is equivalent to interest 
at 100a per cent, compounded n times a year (cf. Examples IX, 26), show that 

r exceeds s by approximately “2™ 

20. If interest at lOOr per cent, compounded yearly and at lOOp per cent, 
compounded continuously are equivalent, show that r - J) =1 Jr* approximately 
when r is small (cf. Examples IX, 26). 

21. Find the amount of £1 after n years at lOOr per cent, interest which is 
(a) simple, (6) compounded yearly, and (c) compounded continuously. If 

is ^all enough to be neglected, show that the last amount exceeds the • 
first by J (rn)* and the second by jr^n approximately. 
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22. A man’s income is £x and his (measurable) utility is then ^(a?) where 
(l>{x) increases at a decreaising rate as x increases. The man makes a fair bet 
on an event with chance p, laying £pa against £ ( 1 - p)a that the event happens. 
Show that his expectation of utility is now £{p<j> (a; + 1 - pa) -f ( 1 - p) (a; - pa ) }. 
If the bet is small, use Taylor’s series to show that the expectation gives less 
utility than the original income £x, (See Marshall, Principles of Economics, 
8th Ed. 1927, p. 843.) 


23. By evaluating derivatives and using Taylor’s series, show that 
log J(1 + e®) = ia; + ia:*- + , 
assuming that the series is convergent. 


24. Expand, by Taylor’s series for a function of two variables, the ex- 
pression ^ ^ as a series in ascending powers of x and y. Deduce linear 

and quadratic approximations to this expression when x and y are both small. 
(Cf. Examples XIII, 7.) 


Higher order differentials 

26. Show that neither y — (x- 1)* nor y={x- 1)® has a maximum or mini- 
mum value and that each has an inflexional point at (1, 0). 


28. Find the partial derivatives, and hence the first two differentials, of 

sach of z==x* •\-2xy - 1 /*, z = r and « = log(a5* + y*). 

x-y+l 


27. Find the second differential of each of z = x* -f- y* - 3xy, z = \/x* + y*, 

X 

2 = and z = log —— where x and y are independent variables. 

28. If X, y and z are independent variables, find the second differential of 
u = (x* + 2xy - y*)e*. 

1 * 

29. If 2 = whore x = e* and y = e“*, evaluate - - • 

x + y dt* 

30. Given that x* + y* - 3xy = 0, use the derivatives of y as a function of 
X and of X as a function of y to show that y has a single maximum value 
for variation of x and conversely. Illustrate diag^ammaticall 3 ^ (See Example 
XIII, 22.) 

31. The relation /(x, y, z) = 0 gives z as a function of x and y. Differentiate 
the relation and obtain the first and second order partial derivatives of z in 
terms of those of /. Illustrate by taking the relation x* + y* + 2* - 3xy = 0. 
(See Examples XIII, 23.) 


32. If z=f(x, y) where x and y are given functions of the independent 
variables u and v, show that 

a*z _ 0*z /dx\* o dx ^ a*z az a*x az a»y 

du* ax* \du/ dxdy du du^ dy* \duJ dx du* dy du* 
and similar results for the other two partial derivatives of the second order. 


33. From the results of the previous example, find the second-order partial 
derivatives of z = log(x~y) where ^ = ~ Express z explicitly in 

terms of u and v and find the partial derivatives directly. 
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DETERMINANTS, LINEAR EQUATIONS AND 
QUADRATIC FORMS 

18.1 The general notion of a determinant. 

Despite its somewhat terrifying name, a “ determinant ” is a 
mathematical tool of a very ordinary kind and involves no new 
ideas of any description. Briefly, a determinant is a notation that is 
found convenient in handling certain involved, but essentially com- 
monplace, algebraic processes. Certain expressions of a common 
form appear in algebraic problems such as that of the solution of 
^ linear equations, expressions consisting of sums or difierences of a 
1 number of terms each of which is the product of a number of quan- 
tities. The expressions {ab - and {ahc -af^-bg^- ch^ -f 2fgh) are 
cases in point. Quite apart from other considerations, the labour of 
writing out the more complicated of these expressions is severe and 
there is every reason to welcome a compact and general notation 
for them. The determinant notation is justified on these grounds 
alone. 

There is, however, more in the determinant notation than this. 
Once the notation is introduced, the expressions denoted by deter- 
minants are seen to obey quite simple rules and the algebraic pro- 
cesses in which they appear become simpler and more uniform* than 
before. As a consequence, results can be established which would 
almost certainly be missed were it not for the new notation. It is 
for such reasons that determinants have become of general use ; no 
notation can be retained unless it saves labour and enables us to 
carry out processes more easily and with greater, generality than 
before. 

The foundation of a determinant is a square “ block ” of quantities 
^tten down in rows and columns in “ crossword ” form. Th^re is 
no restiiction on the quantities except that each must be capable of 
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taking a single numerical value ; they can be numbers, constants, 
variables, functions, derivatives of functions, and so on. The number 
of the quantities must be a perfect square ; four quantities can be 
written down in a square form of two rows and two columns, nine 
quantities in a square form of three rows and three columns, and 
so on. Examples are provided by the following arrangements : 

du du xy xz 

^ "“A 'TT' 

dxdy 

’ dv dv y y^ 

OX dy xz yz 

A determinant of two rows and columns is called a determinant of 
the second order, of three rows and columns a determinant of the 
th rd order, and so on. In general, a determinant of the nth order 
as n* quantities arranged in n rows and n columns. The quantities 
themselves are called the elements of the determinant and we can 
speak of the elements of the first row or column, of the second row 
or column, and so on. Each element is allotted, of course, to one row 
and to one column. 

A general method of denoting the block of quantities making up 
a determinant is required. From the point of view of stressing the 
arrangement in rows and columns, the best notation is 


O^J 

^12 

0^13 ... 

^In 


®22 

^23 

®2n 

^nl 

^n2 

(7„3 



where the integer n denotes the order of the determinant. The two 
suffixes of an element denote the row and the column into which the 
element is to be placed. Thus, is the element to be inserted in 
the ‘‘ ftell at the intersection of the fifth row and third column. It 
must be noted, however, that the notation is designed for deahng 
with determinants in general ; in any particular case the elements 
appear as actual numbers or quantities of one kind or another. 

18.2 The definitioi]^ Qf determinants of various orders. 

A good working definition of determinants can be given in suc- 
cessive stages as follows.* A determinant of the second order is 

^ • For a more strict and general definition, see Netto, Die Determyiayitt)gi 

(2nd Ed. 1926), pp. 8-13. 
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defined as taking its value from the ‘‘ cross multiplication ** of the 
four elements which compose it and we write 


Ull ^2 — % 1®22 

U21 ®22 

The determinant on the left is simply another way of writing the 
algebraic expression on the right. 


Ex. 1. 
Ex. 2. 
Ex. 3. 
Ex. 4. 


= 2x3-(-l)x0=6. 

=a x6 - A X A = a6 - A*. 

= X 1/2 - 2xy x 2xy » - Zx^y^, 


2 -1 
0 3 

a h 
h b 

2xy 

2xy t /2 

du du ^du dv du dv 
dx dy dx dy dy dx * 

dv dv 
dx dy 

where u and v are two given functions of x and y. This determinant, 
which is often written g 

d(Xyy)^ 

is termed the Jacobian of u and v with respect to x and y. 


• A determinant of the third order is defined in terms of those of the 
second order by the rule : 


Oil 

(h2 

(ha 

— hi 

^22 

^23 

— ha 

^21 ^23 

+ ^13 

0^21 ®22 

U 21 

®22 

®23 


U 32 

^33 


®31 ®33 


^31 ^32 

®81 

^32 

®33 









— ®11 (^22^33 ~ ^23^31 ) ~ ®12 (^21^33 ®23®3l) 

4- (^21^32 ~ ®22^3l) 


— ^ 11 ^ 22 ^ 83 't' % 2 ® 23®31 ® 13 ^ 21®32 

• r< 

— U22U23U32 ^ <^ 2 ^ 21^33 ^ 13 ^ 22 ^ 31 * 

The second-order determinant multiplying is derived from the 
original determinant by omitting the first row a^i^^ the first column 
(the row and column intersecting in a^), and similarly for the other 
determinants. The third-order determinant is thus seen to be a 

t rt way of writing an algebraic sum of six terms, each term being 
product of three elements of the determinant so chosen that one 
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element comes from each row and one from each column. Further, 
half the terms are added and half subtracted to form the algebraic 
sum. 


Ex. 1. 


Ex. 2. 


Ex. 3. 


Ex. 4. 


1 3 

-1 0 
0 -2 


-2 

3 

1 


= 

0 3 

-3 

-1 3 

-2 

o 

1 


-2 1 


0 1 


0 -2 


0 a 

p 

— — OL 

a a 

h 


j8 h 

b 


a h 

9 

= a 

h b 

/ 


9 f 

c 

— abc 

du 

du 

du 

dx 

dy 

dz 

dv 

dv 

dv 

dx 

dy 

dz 

dw 

dw 

dw 

dx 

dy 

dz 


= (0+6) -3(-l -0)-2(2-0)=6+3-4=5. 

2h<xP - boL^ - aj8*. 


OL h 

+ P 

a a 

P b 


/3 h 


b f 
f c 




br/-^ch^+2fgh. 


du dv div du dv dw du dv dw 

dx dy dz ^ dy dz dx ^ dz dx dy 

du dv dw du dv dw du dv dw 

dx dz dy dy dx dz dz dy dx 


This is the Jacobian, 


d(u, v, w) 
d(x, y, z) 


, of lA, v and w as functions of x, y and z. 


A determinant of the fourth order is then defined : 



Ul2 

^13 

Ul 4 

— ttjj 

^22 

(f 23 

^24 

O'] 2 

O 2 I 

^23 

^24 

aj, 

^22 

^23 

U24 


U32 

^^33 

^34 


O 31 

^33 

«34 

«31 

U32 

^33 

^34 


«42 

«43 

U44 


U44 

^43 

U44 

^41 

• 

«42 

®48 

«44 

+ «13 

«21 

«23 

Cc24 


^21 

^22 

^23 






^31 

^32 

^34 


^31 

O 32 

^33 


• 




^41 

«42 

U44 


^44 

U42 

^43 


This is an obvious extension of the rule for third-order determinants 
and it remains to <jvaluate each of the determinants shown on the 
right-hand side and collect terms. It is then found that the deter- 
minant denotes the algebraic sum of 24 terms, each term consisting 
of four of the total of sixteen elements so chosen that one comes from 
*each row and one from each column. Again, half the terThs are 
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positive and half negative in the same kind of way as before. The 
determinant notation represents more and more lengthy expressions, 
and so becomes more and more convenient, as we j)roceed. 

In general, a determinant of the nth order is written down in terms 
of determinants of the (n - l)th order by the rule : 

^3 ••• ®in ^22 ^23 ••• ®2n ^^12 ®21 ^23 ••• ^2n 

®21 ®22 ®23 ••• ® 2 n 

®n2 ^n3 ••• ®nn ®nl ^nS ••• ®nn 

®nl ®n2 ®n8 ••• ®nn 

4-.-. . 

The signs of the successive entries on the right-hand side are alter- 
natively positive and negative until all the n elements of the first 
row are taken. The {n- l)th order determinant multiplying is 
obtained from the original determinant by the omission of the first 
row and rth column. The determinant is now to be evaluated by 
going, step by step, back to determinants of the second order and 
BO to the elements themselves. It is then foimd that we have the 
algebraic sum of terms each of which is the product of n of the n* 
elements selected so that one element comes from each row and one 
frc'm each column. Half the terms have a positive and half a 
negative sign. The number of terms in the sum is 

|j^ = n(n - l)(n - 2) ... 3.2 . 1, 

which is very large when the order of the determinant is high. 

The determinant notation as written above can be sometimes 
abbreviated to 

I a,,, I (r and 5 = 1, 2, 3. ... w), 

tt 

where a,,, is a typical element (in the rth row and sth column). 

A determinant, therefore, is a notation expressing aij algebraic 
sum of terms which are products of the elements of the determinant 
and its value is of the same nature (e.g. numerical or a function of 
certain variables) as the elements which composQ. it. The practical 
way of finding the value of a given determinant is indicated by 
the definition, i.e. by successive reduction to determinants of lower 
lower order until second-order determinants are obtained and 
evaluafed by cross multiplication. 
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18.3 Properties of determinants. 

The following are simple properties of the determinant notation : 

(1) A determinant is imchanged in value if its rows and columns 
are transposed : 

Oji 0^2 ®13 ••• ~ ®11 ®21 ^31 ••• ^nl 

^21 ^22 ®23 ••• ^ 2n ^12 ^22 ®32 ••• ® n2 

®nl ®n2 ®n8 ••• ^nn ^In ^2n ®3n ••• ®nn 

(2) The value of a determinant is unaltered numerically but 
changed in sign if two rows (or two columns) are interchanged : 

e.g. I (li 2 CLxL ‘®13 ••• ^In = “ ®11 ®12 ^3 ••• 

^22 ®21 ^23 ••• ^ 2 n ^21 ®22 ®23 ••• ® 2 n 

^ti2 ^nl ^n3 ••• ®nn ®nl ®n2 ®n3 ••• ®nn 

(3) The value of a determinant is increased A;-fold when each 
element in one row (or column) is increased in this way : 

e«g* IcCt^-y lcd-^2 ^^13 ••• ~~ k ®12 ®13 ••• 

®21 ®22 ^23 ••• ® 2 n ^21 ^22 ®23 ••• ^ 2 n • 

^n2 ®n3 ®nn ®nl ®n2 ••• ^nn 

(4) A determinant has zero value if the elements of one row (or 
of one column) are equal or proportional to the corresponding 
elements of a second row (or of a second column). 

(5) If the elements of one row (or of one column) of a determinant 
appear each as the sum of two parts, then an additive rule appUes : 

••• ®ln+^n^ = ®12 ••• 

^ 0/2X ®22 ••• ®2n ^21 ®22 ••• ®2n 

Onl ^2n ••• ^nn ®nl ®n2 ••• ®r»n 

/ / _ / 

®11 ®12 ••• ^In 

^21 ^22 ••• ®2n 

^nl ®n2 ••• ^nn 

(6) The value of a determinant is unchanged when a multiple of 
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the elements of one row (or of one column) is added to the corre* 
sponding ebments of a second row (or of a second colunm) : 


e.g. 

Ctn -f k(l2i 

(I 22 "h k(l22 

... ~f" k(t2fi 

r= 

Ou 

... 

Oln 


0,1 

U 22 

... a2n 


(121 

^22 ... 

®2n 


am 

U„2 

... 

... ^nn 


®nl 

Ct'ni ••• 

®nn 


To make clear the meaning of these properties, it is a useful 
exercise to verify that they hold in the particular cases of second 
and third-order determinants. It is also a relatively easy matter 
to give general proofs, using properties of the values and signs of the 
terms in the expanded form of a determinant (18.2 above).* The 
practical evaluation of determinants is often simplified by using the 
above properties, e.g. a combination of ( 6 ) and (4) sometimes shows 
that a determinant has zero value. The following examples illustrate : 
Ex. 1 . Show that 1 2 3=0. 

4 6 6 

7 8 9 

Take the elements of the first row from those of the second and third 
rows, using property ( 6 ) with -1. Then 

1 23 = 1 23=0 by property (4). 

4 6 6 3 3 3 

7 8 9 6 6 6 

The same result holds if the elements of the determinant (reading by 
roWs from left to right) form any set of integers in Arithmetic Progression. 
Ex. 2. Express 1 a in factorial form. 

1 6 62 

1 C C2 

If a = 6 , the first two rows of the determinant become identical and it has 
zero value. Hence, (a - 6 ) must be a factor of the value of the determinant. 
Similarly, (6 - c) and (c - a) are factors. Since the value of the determinant 
is of the third degree in a, 6 and c, it must be a numerical multiple of these 
factors. The leading term in the expanded value is 1 x 6 x 0 ^ (from the 
“ diagonal ” of the determinant) and the multiple can only be unity. The 
value of the determinant is thus (a ~ 6 ) (6 - c) (c - a). 

18.4 Minors and co-factors of determinants. • • 

Selecting an element of a given determinant, we delete the row 
and column intersecting in the element and obtain a determinant of 

4 See Netto, op, ciL, pp. 13-9, and Courant, Differential and Integral Cal- 
oulust Vol. II (English Ed. 1936), pp. 20-3. 
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order one less than that of the original determinant. The derived 
determinant is called the minor of the selected element in the given 
determinant. There are minors of a determinant of the nth order 
and each is a determinant of the (n - l)th order. In 
^ =1 I 3 = 1, 2, 3, ... n) 

the minor of the typical element is denoted by as obtained 
from A by omitting the rth row and 5th column. 

The co-factor of a selected element in a given determinant is the 
minor of the element with a sign attached. The rule of signs is 
quite simple. If the numbers of the row and column containing the 
element add to an even number, then a pins sign is given to the co- 
factor ; if they add to an odd number, then a minus sign is allotted. 
The co-factor of a,., in J is denoted by where 
Ars=Arg if (r-f5) is even 
= -A^^ if (r-f 5 ) is odd. 

It is important to distinguish between the closely related concepts 
of minors and co -factors. The minors of various elements in 


fltii 0^2 ^13 

®21 ®22 ^23 

^31 ®32 ^33 


are 

II 

®22 

®23 

> ^12 — 

®21 

^23 

5 ^21 — 

012 ^3 



®32 

^33 


^31 

®33 


^32 ^^33 


On the other hand, the co-factors of the elements are 


All — 

^22 

^23 

; Ai2— - 

a 21 

^23 

; ^ 21 — ~ 

(^12 

^3 


^32 

^33 


^31 

^33 


^32 

^33 


Our definition of determinants can be re-framed in terms of 
mindrs or co-factors. We have 

A = I a^g I =0^1 All ~ ^2 ^12 ^13 ^13 ~ “1" ( ■” 1 

or 12 "t" ^3-^13 "t” • • • "t* ^In ^ln> 

i.e. the value of A equals the sum of the elements of the first row 
each multiplied by the corresponding co-factor. A similar result 
can be shown, by property (2) of 18.3, to hold for the elements of 
any row. For example, taking the second row, we have 

^21*^ 21 "b ^22^^ 22 ^23^23 "!“••• "i"fl^2n'^2n ^ 

~ — (^21^21 ^22^22 "i” ®23^23 • • • i ^2n^2n)* 
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The bracket on the right-hand side is, by the definition, the value 
of the determinant obtained by interchanging the first two rows of J 
The bracket thus equals and so 

^ ~ ^21-^21 22 ”f“®23'^23 “i" ••• '^^2n^2n’ 

Again, take the sum of the elements of one row of the determinant 
each multiplied, not by the corresponding co-factors, but by the 
co-factors of the elements of another row, e.g. 

^21-^U +^22^12+^28-^13 + ••• +®2n^ln* 

This sum must, by the definition, equal J except that the first row 
in J is replaced by the row of elements (agi, a 22 > ® 23 i ••• ® 2 n)> 
sum represents a determinant with the first two rows consisting of 
identical elements. By property (4) of 18.3, the sum is zero. 

The following results are now established : 

The sum of the elements of any row of A = \a^^ \ each multi- 
plied by the corresponding co-factor is equal to the value of the 
determinant. The sum of the elements each multiplied by the 
co-factor of the corresponding element of another row is zero. 
In symbols, for xmequal values of r and 

(Xfi (lf2 ^ r 2 + ^r3 -<4 ,.3 4“ . . . + d^n A m ~ ^ 1 

driAgi-\~ar2^»2'^^rZ^9Z + ••• +®rnA,„=0 / 

The first result, an extension of the definition, is often called the 
expansion rule and its use in evaluating determinants is evident. By 
property (1) of 18.3, the results are true also for the elements of any 
column of the determinant. 

Minors of higher order than those discussed above can also be 
defined. Selecting two elements not in one row or column and 
omitting the two rows and two columns containing them, a second- 
order minor is derived as a determinant of order two less than that 
of the original determinant. The process can be continued by 
selecting more and more elements and omitting ,tl}e relevant rows 
and columns. In particular, the principal minors of J = | a,., | are 
the minors of various orders obtained by selecting the last, the last 
fmo, ... elements of the ‘‘ principal diagonal ” (Oii, a 22 , a^^, ... a„„) 
an^ then carrying out the deletion process indicated. 
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18.5 Linear and homogeneous functions of several variables. 

The determinant notation serves to express precisely the pro- 
perties of linear and homogeneous functions of more than two 
variables. Let y—J(x^y x^, ... x^) be a linear and homogeneous 
function with continuous partial derivatives of the first two orders : 

(»‘and 5 = l, 2, 3, ...n). 

Then, Euler’s Theorem (12.8 above) can be extended to give : 


^fi+X2f»+^sfa + — +X„f„ = y ( 1 ) 

at any point (Xi, X2y Xg, ... x„). Since the result is identically true, it 
can be differentiated with respect to any variable Xr : 

fr '^^2f2r ••• 'I’^n/rn ~ fry 

be- ^flr-^^2f2r-^^zSzT^—-^^nfrr,=^y (2) 

(r = 1, 2, 3, ... n). 


The n relations (2) are extensions of the results (4) of 12.8 and give 
each direct second-order partial derivative in terms of the cross- 
partial derivatives. 

We denote by F the determinant of the (n + l)th order formed 
from the block of second-order partial derivatives of/, “ bordered ” * 
with the first-order partial derivatives : 

F= 0 fi fg fz ••• fn 

fl fll fl2 /l3 fin 

fz fl2 fzz fzz ••• fzn 


\ Jn Jin J 2n J 3n ••• J nn 

Let Fc denote the co-factor of the element 0 in i'’ and co- 

factor of the element in Fg (r and « = 1, 2, 3, ... n). Then 


(3) 

and . (4) 


The proofs of (3) and (4) are interesting exercises in the manipu- 
lation of determinants : 

Multiply the relation (2) by i^oir(^ = 2, 3, ... n) and add together 

the n equations obtained. Then 

^(/ii ^ ou +fl2^ +/ln^ Oln) ^ 

+ Xt{fr +/,„J'’oi«) + ...=0. 
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By the results of 18.4, the coefficient of Xi is Fq and the coefficients 
of ajj, ... Zn are all zero. Hence, 

0^1 = 0 

for any value of z^, i.e. This proves (3). 

The determinant F can be evaluated as follows i 






aii/i 

fi 

Jl 

xJn 

fli 

Ji 

^ifli 

fa 

Js 

^ifis 

fiS 

Jn 

••• ^1 fin 

• • • fin 

fn 

fin 

fin 

fsn 

• • • fnn 

0 

fi 

A 

h 

... fn 

y 

0 

0 

0 

... 0 

/a 

fli 

fa 

fiS 

• * • fin 

fn 

fin 

fin 

fsn 

••• fnn 


by 18.3, (3) 


by 18.3, (6) 


adding z^ times the elements of the third row, z^ times the elements 
of the fourth row, ... to the elements of the second row and using 
(1) and (2) above. Expanding in terms of the elements of the second 
row and applying a similar manipulative process aU over again (with 
columns instead of rows), we find 


.1 


/i 

/l2 


/2 

/22 


/a 

/23 


fin fin fan 
= 


fn 

fin 


h 


y 


^ifi 

^l/l2 


fi 

fa 


fs 

fiS 


fn 

fin 


^'ifln fin fzn ••• fn\ 


y fi fs 

fi fa fis 


fn 

fin 


0 fin fi 


3n 


U 


if IP 

- ~ ^ Oil- 


Hence, F^i^ = - ^ F, which is one of the results (4). The /)ther 
results follow by similar reductions of F, 


18.6 The solution of linear equations. 

The determinant notation is of particular use in solving lineal 
equations, operating with quadratic forms and making linear sub- 
stitutions. We propose to give here some account of the use of the 
notation in the first two problems.* 

f For^ applications of the determinant rotation to linear substitutions, see 
Netto, op, cit,f pp. 77 et seq, and pp. 100 et aeq, * 
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As we have seen (11.6 above), a system of exactly n equations 
in n variables is sufficient to determine the values of the variables 
provided that the equations are consistent and independent. But 
this tells us nothing about the actual values obtained in the solution, 
or even whether one or more alternative solutions exist. These points 
can be cleared up at least in the simple case where all the equations 
are of linear form. The solution of the pair of hnear equations 

+ and a^. 


in two variables is (by the method of 2.9 above) given by 


and 


~~~ 0/2^0^ 
- a^bi 


Using the determinant notation, the solution can be written 

^ _ y 1 


Cl 



ttl 

Cl 


«1 


C* 

62 


0-2 

C 2 



b. 


The solution is unique and appears in terms of second-order deter- 
minants involving the coefficients of the equations. 

To generalise the result, consider a system of n hnear equations 
in n variables, Xg, x^, ... x^ : 


^12^2 "i" ^13^3 "H • • • “h ®ln^n — 
®21^1 “i" ^22^2 +^23^3 + ••• + ®2n^n “^2 


®nl^l "I" ®n2^2 "i" ®n3^3 'I' • • • + ^nn^n “ ®n' 

The given coefficients in the equations form a square block of ele- 
ments giving the determinant J = | a,., | (r and 5 = 1, 2, ... n). Here, 
a,., is the coefficient of the ^th variable (a;,) in the rth equation. 
The constant terms (a^, a 2 , ... «„) on the right-hand sides of the 
equations are kept separate. Assuming that J 0, the equations are 
solved by the following device. Multiply the equations respectively 
by ^ 21 , ... the co-factors of the elements of the first column 
of Ay and add. Then 

-f 021-421 4" ... + (®12^11 "h ®22-^21 + . . . +Cfrj2^nl)^a 

-f ... -f (ai,j.4ii 4-02„-42 i 4 - ... 4-a„„.4„i)a:,j = ai.4ii 4 - 02^^21 4- ... 4-o^n^ni* 

By the results of 18.3, the coefficient of ;Ci here is A and t]^e 
coefficients of the other variables are aU zero. Further, let Jj 
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denote the value of A when the first column of elements is replaced 
by the column (Oi, Ug, ... a„), so that 

The equation then becomes Ax^^^A^y i.e. 



Similar processes give the values of the other variables and so : 


The solution of the n linear equations written above is unique 
and can be expressed in the form 

where A is the nth order determinant | a^, | and A^y A 2 , As, A^ are 
the determinants obtained by substituting the column (Oj, Uj, ... a„) 
for the first, second, third, ... and nth columns of A respectively. 


The result for two variables given above can be verified as a 
particular case of this general solution.* 


Ex. 1, The equations 


3x + y-62=0, -a; + 3y +42=5, x + y + 22=:4 


give 

X y 


0 1 -6 


3 0-6 

5 3 4 


-15 4 

4 1 2 


1 4 2 


48 36 


z 1 


3 1 0 


3 1 -6 

-13 5 


-13 4 

1 1 4 


1 1 2 


A_JL 

^“36’ 


i.e. 


y==L 2=i 


Ex. 2. In considering the variation of demand for factors of production 
(14.8 above), we had to solve the equations ^ 


_ bpi, xp 

rfU-i-paV+PiyW=0, cru -—V+p^W=—a, 

® Pa 

for the three variables 


(tt) 

ou + p^v - m ; « 0 


u 


da 


dp _ 


and w = 




♦ Wo can now see, at least roughly, why J = 0 is a case of failure of our 
result. If id = 0 and J,, J,, J,, ... only infinite values of the variables 

can satisfy the equations, and the latter are thus inconsistent for all finite 
values of the variables. If some of J,, J,, . . . are also zero, then at least 
due vaiiable becomes indeterminate (zero divided by zero) and the equationsi 
are not independent. 
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V' 




0 

Pa 

Pb 


V 

0 

Pb 


xp 

<7 

Va 

a 

Pb 


a 

xp 

— <7 

Pa 

Pb 



0 

Pa 

aPa 

b 


a 

0 

apa 

b 




W 




1 




V 

Pa 

0 


V 

Pa 

Pb 





xp 


QT 

_bn 

Pb 




a 

Pa"" 



a 



a 

Pa 

0 


G 

Pa 

(^Pa 

b 

three determinants are 

easily expanded 

since 


each case contains two zeros. The other determinant, expanding in terms 
of the first column, equals 

V (PaPi, -PaPl,) - ^ -PaP^ + {paPb +^) 




since x'p = ap^ + hp^. Hence, 

u V 


w 


1 


xp 

T' 


xp 


i.e. 


and 


<j{apaV + f>p^a) 

1 

da 


Wa~ 

Pa\xp 

i* _ 

ab . 

SPa 

xp 


-xpG{rj-a) 2 , 

+ 5 &„) 

xp J 


18.7 Quadratic forms in two and three variables. 

An expression which is the sum of a number of terms each of 
the same (given) degree in certain variables can be termed a, form. 
Classifjdng forms according to the degree of the terms, we distinguish 
linear forms, quadratic forms, cubic forms, and so on. The following 
analysis is concerned with forms no more complicated than the 
quadratic and makes their nature clearer than any general remarks 
we can offer. 

The general expression of a linear form is (ax + by), (ax + + cz), . 

*when there are two, three, ... variables. Here a, b,c, are constant 
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coeflBcients. It is clear that linear forms present no algebraic dilB- 
culties and no special theory need be developed. 

The general quadratic form is 

{ax^ + by^ + 2hxy), {ax^ + by^ -f cz^ 4- 2fyz + 2gxz + 2hxy), ... 
according to the number of variables. There is now more algebraic 
difficulty in dealing with such forms and the main problem that 
presents itself is to determine whether a given form has the same 
sign (i.e. positive always or negative always) for values of the 
variables which are not aU zero. The solution of many problems in 
mathematical analysis depends on the exact conditions under which 
this property holds. 

With tw’o variables, we can write the quadratic form 

F(Xy y)=ax^ + by^ + 2hxy=a [xi-- y ) 4- yS 

adopting a process (familiar in elementary algebra) known as “ com- 
pleting the square ’’ in the variable x. It follows that F(x, y) is only 
positive for all values of x and y not both zero, if the two squares in 
the above expression have positive coefficients, i.e. if a and (ab-h^) 
are both positive. Similarly, F{Xy y) is negative under the same 
conditions only if the coefficients are negative, i.e. only if a is nega- 
ti'^e and {ab - h^) positive. These conditions are easily seen to be 
both necessary and sufficient. Writing the expression {ab - h^) in 
determinant form, we have the result : 

The quadratic form (ax^ + by^ + 2hxy) is positive for all values 
of z and y (other than a: = y = 0) if, and only if, 
a>0 and a h >0 
h b 

and negative for all values of x and y (other than a: = y=*0) if, 
and only if, the same expressions are negative and positive 
respectively. o 

It may be objected that the condition written is unsymmetrical 
in the sense that the coefficient of x^, rather than that of j/*, is 
selected for determination of sign. In fact, if we complete the 
square in y instead of in x, we find that F (a;, y) is always positive if 
^ and {ab - li^) are both positive and always negative if 6 is negative 
arid {ab - h}) positive. The essential point, however, is that these * 
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second conditions are equivalent to the first since {ab — is only 
positive if ab is positive, i.e. if a and b have the same sign. Our con- 
dition is thus quite symmetrical. It may be objected also that the 
determinant notation is not necessary here. This is true but the 
notation is introduced to facilitate the generalisation of the result 
to cases of more than two variables. It is to be noticed that the 
determinant used is most easily written from the coefficients of the 
quadratic form when arranged : 

y)~ax^ ^-hxy 
-I- hxy -f- by‘^. 

The determinant is called the discriminant of the quadratic form. 


In the case of three variables, we write 
F (Xy y, z) = ax^ + by^ -h cz^ -f 2fyz + 2gxz + 2hxy 

= a \x-\-~y 

\ a a ) 


2 afe-A2 „ ^af-gh 


+ - 


a 


r+- 


^2 + 2 - 




( h g ab-K^ [ af -gh Y 


^ ahc-ap-bg‘^-ch^ + 2fgh , 


where we have completed the square ” twice in succession. Now, • 
F (x, y, z) is positive for any values of the variables not all zero 
if, and only if, the coefficients of the three squares in the above 
expression are all positive, i.e. if * 

a, (ab-h^) and {abc-aj^ -bg^ -ch^ + 2fgh) 
are positive. Similarly, F(x, y, z) is negative for any values of the 
variables not all zero if, and only if, the same coefficients are aU 
negative, i.e. if the three expressions written above are negative, 
positive and negative respectively. The determinant notation is 
again applicable and we have the result : 

The quadratic form {ax^ + by^-^cz^ + 2fyz -f 2gxz + 2Aa:y) is positive 
for all values of x, y and z (other than x = y = z=^0) if, and only if. 


a h 

>0 and 

a 

h 

9 

h b 


h 

b 

f 



9 

f 

c 


and negative for all values of x, y and z (other than x ^y — z — 0) if, 
and only if, these expressions are negative, positive and negative 
respectively. * * 
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It can be shown, as before, that the conditions set out here are 
symmetrical and can be written in two equivalent ways by selecting 
the coefficient 6 or c instead of a. The third-order determinant 
written is called the discriminant of the quadratic form and is most 
easily derived when the form is arranged : 

F{x, y, z)—ax^ + hxy -\-gocz 
-\-hxy-\-by^ +fyz 
+gxz-{-fyz+cz^. 

A more restricted problem of the same nature is that of finding 
the conditions imder which a quadratic form preserves a given sign, 
not for all values of the variables but for values satisfying a given 
linear relation in the variables. The general method of solving this 
problem is quite simple. The linear relation gives one variable in 
terms of the others and the quadratic form is then reduced to a form 
in one less than the original number of variables. The previous 
results apply at once. Suppose that the two variables of the quad- 
ratic form {ax'^-\-by^-{-21ixy) are related aa:+j8i/ = 0, where a and j8 

are constants as well as a, b and c. Then y = - ^ a: and the quadratic 
form becomes : ^ 

ax^ + 6 “ - 2A ^ = {bet} -f - 2Aaj3) ^ , 

' P P P 

which is positive (negative) for all values of x (and for the corre- 
sponding values of y given by the relation) if, and only if, the ex- 
pression {bo} -{■ ap^ - 2h(xP) is positive (negative) From an example 
given above (18.2), the condition can be written in determinant 
form : 

The quadratic form (ax^ by^ -{- 2hxy) is positive for all values of x 
and y (other than a;=y = 0) which satisfy aa;-f-j8y = 0 if, and only if, 

0 (X p <0 

a a h * 

p h b 

and negative under the same conditions if, and only if, the deter- 
minant is positive. * « 

Notice that the third-order determinant is obtained from the dis- 
criminant of the original quadratic form by ‘‘ bordering ’’ with the 
coefficients a and p from the linear relation. 
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In the three variables case, the variables in the quadratic form 
{ax'* + by^-‘rCz^ + 2fyz-\-2gxz + 2hxy) are related ax+^y +yz=iO. Eli- 
minating x= --{)8y+yz), the form becomes 

(X 

^ + yz)* + by* + cz* + 2fyz -—(^y + yz)z- — {Py + yz)y 
oc oc oc 

= Ay^ + Bz^ -f 2Hyz, 

where ^ ^ “ 2Aaj8), 5 = (coc^ + ay^ - 2g(xy) 

and H = — - hocy -f /a^) . 

The form is positive for all values of y and z (x being given by the 
linear relation) if ^ > 0 and AB ~ H^>0, i.e. if 

b(x^ + — 2hoc^ > 0 

and {be -f^) + {ab - g^) -f {ab - h^) y^ 

- 2 (a/ - gh)py - 2 {bg -fh) ay - 2 (ch -fg) a/3 > 0. 

The form is negative under the same conditions if the two expres- 
sions given above are negative and positive respectively. It is 
easily verified that the expressions are simple determinants and so : 

• 

The quadratic form {ax^ + by^ + cz^ + 2fyz 4- 2gxz + 2hxy) is positive 
for all values of a;, y and z (other than z = y = z — 0) which satisfy 
ocx-\-Py -hyz = 0 if, and only if, 


0 

a 


< 0 and 0 

a 


y 

a 

a 

h 

a 

a 

h 


P 

h 

b 


h 

b 

f 




y 

9 

/ 

c 


and ilegative under the same conditions if, and only if, the two 
determinants are positive and negative respectively. 


18.8 Examples of quadratic forms. 

The following examples illustrate our results : 

Ex. 1. (x^ + 2y%-^z^ +yz -{-2xy) is positive for all values of a;, y and z 

which are not all zero ; for 


1 > 0 , 


1 1 =»1>0 and 

1 

1 

0 

1 2 

1 

2 

i 


0 

1 

1 


|> 0 . 
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Ex. 2. - -k-xy) is negative for aU values of x, y and z 

subject to the relation + y +22 = 0 since 

-KO, 


0 

1 

1 

= 1>0 and 

0 

1 

1 

2 

1 

1 

\ 


1 

1 

i 

0 

1 

i 

~1 


1 

i 

-1 

0 





2 

0 

0 

-7 


Or, eliminate 2 =» •“ i (^ + y) a-nd write the quadratic form 
a;2 - J(a; + t/)* +xy = - J(3x* + \0xy + 

But (3x^ + lOxy + lit/*) is positive for all values of x and y since 


3>0 and 


3 5 

6 11 


= 8 > 0 . 


The original quadratic form is thus negative for all values of x and y 
and for 2 = -\{x-\-y). 

Ex. 3. i/i, t/2 Vz given functions of three variables Xj, X2 and Xj 

such that 

and 

9x2 dx^ * 9 x 3 9x2 9xi 9 x 3 

It is required to find the conditions under which 

* Ix^ ^ Ix^ ^ Ix^ 

• + dxi dx2 + dxg* + rfxg dx, 

OXj^ 0X3 

ali ^ 

is negative for all values of dx^, dxa and dx^* The conditions are 


dxi 


<0 


9xi 9x2 

^3 ^ 

9xi 9x2 


>0 and 


%i 

9xi 9^2 9 x 3 

^ ^ ^2 

9xi 9x2 9 x 3 

^ ^ ^ 

dxi 9x2 9x3 


< 0 , 


i.e. the Jacobians 


W’ xi negative, positive 

t/Xj vv^l> •*'2/ ^2> ^8/ 


and negative respectively. 

# See Jj^amsey, A Contribution to the Theory of Taxation, Economic Journal,* 
1927, p. 60. 
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18.9 Two general results for quadratic forms. 

The problems considered above can now be generalised. The 
general quadratic form in n variables 0 : 3 , ... is 

F{x^, X2y ... xJ=ai^Xi^-i-a22X2^-h... 

•f 2a^2^j^X2 4- 2ai2^iX2 4 - . . . + 2a2^^z + • • • » 
which can be written more symmetrically as 

F ( 3 *^, 2^2, X^y . . . Xy^ = 4- . . . “j“ 

4" 021^1^2 ”1“ ^22^2^ 4“ ... 4" ®2n^2^n 

4- 

+ + «n 2 ^ 2 ^n 4- ... 4- 

where the a’s denote given coefficients such that the order of the 
suffixes is immaterial {a 2 i=ai 2 y •..). The determinant J = |a,., l, 
written down from coefficients of the quadratic form, is called the 
discriminant of the form. We also need the “ bordered determi- 
nant 


= 

0 

«! 

oca ... 

an 


«! 

flu 

®12 ••• 



aa 

^21 

0^22 ••• 

^2n 


an 

®nl 

®n2 ••• 

®nn 


where a^, aa> ••• «n ^re some constants. • 

The quadratic form F {x^y X 2 y x^y . . . x^) is said to be positive 
definite if it takes only positive values for all permissible values of 
the variables which are not all zero. Similarly, the form is negative 
definite if it takes only negative values. Two general results can 
now be stated : 


( 1 ) The quadratic form is positive definite if, and only if, 
A and all its principal minors are positive, i.e. 


au> 0 , 


Oil 

^12 

>0, ... 

Ou 

Oja 

“in 

®21 



o»i 

Uaa 

a>2n 

# • 



®nl 

Cl„2 

^nn 


>0 


and the form is negative definite if, and only if, the above 
expressions are alternatively negative and positive. 

( 2 ) The quadratic form is positive definite subject to the 
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relation aia;i4-a2a:2 + ... = 0 between the variables if, and 
only if. A' and all its principal minors are negative, i.e. 


0 


aa 

< 0 , ... 

0 


0^2 

•• a„ 


<hi 

®ia 


“i 

Oil 

012 

.. 

^2 

®21 

Uaa 


0 C 2 

C 121 

022 •• 

.. Ojn 





an 

^nl 

0^2 •- 

•• ^nn 


and the form is negative definite under the same conditions if, 
and only if, the above expressions are alternatively positive and 
negative. 

The proofs of these general results follow lines similar to those 
indicated in the particular cases already established and no further 
details need be given here. The proof of (1) depends on the 
reduction of the quadratic form to the sum of n squares, all the 
coefficients of which must be positive (or all negative). The proof 
of (2) is obtained when the quadratic form is reduced from one in 
w variables to one in (n - I) variables by means of the given relation. 

EXAMPLES XVin 
Determinants 


1 . 

Show that 

1 

1 

1 

= 

1 and I4-0 1 

1 1 = 060 4 - 06 4 - CO 4 - 6c, 



1 

2 

3 


1 14-6 

1 1 



1 

3 

6 


1 1 

1 + c 1 

2 . 

Evaluate 

0 

a 

P 

Y 





a 

a 

h 

9 





P 

h 

b 

f 





Y 

g 

f 

c 




8. Use property (6) of 18.3 to show that 


a 

b-a 

c + b =0 

6 

c 

and 

0 

1 

1 

= 0 

1 

1 

6 

c^b 

a4-c 6 

c 

a 


1 

a -f 6 

6 

1 

a 

0 

c 

a-c 

64-0 c 

a 

6 


1 

a 

04-6 

1 

0 

’6 


Hence evaluate the determinants. 

4 . Show that 6 + 0 a a = 4 abc ♦ 

b c + a b 

c c 04-6 

by noticing that the determinant vanishes if (e.g.) o = 0. 

5 . Show that I a? o o =(a?-o)* 1 0 a ( 18 . 3 , ( 3 ) and (6)). 

j o a: a 0 l o 

; , I a a X — 1 - 1 a? 

gOedu^e that the determinant equated to zero gives a cubic equation in x ^ 
with two roots a; = o and a third root «= - 2a. 
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6. Prove that a 

0* 

a* - 

-1 

= (a6c-l) 1 

a 

a* ( 18 . 3 , ( 2 ) and (6)), 

b 

6* 

68- 

- 1 

1 

b 

6» 

c 

c* 

c»- 

- 1 

1 

c 

c* 


and hence evaluate the determinant. 


7 , Factorise 

1 

«i 

a,» .. 

.. aj"-* 


1 


a,* .. 

.. a,"-* 


1 





8. Write down the co-factors of the elements of the second and third rows 
of the second determinant of Example 1 above and verify the results of 18.4 
in this case : 

(l + a)A„ + A„ + = 0 and (1 4-a)A3j -f A,, + A3, = 0. 

9 . Show that u — (where a + jS + y=l)isa linear and homogeneous 

function for which 

du dhl __oc.pu 

dx~ X* ”* * dx* X* * *** * dxdy xy * 

Verify the determinant properties of 18 . 5 . 

Linear equations 

10 . Solve 2x-t/ = 2, 3 i/ + 22 = 16 and 3 z + 6x = 21. 

11 . If a, 6 and c are given constants, find the solution of 

-x + y + z = a, x-t/ + 2 = d and x -h t/ - z = c. 

12 . Use the determinant of 18 . 3 , Ex. 2 , to solve the equations 

x + 2/4-z=l, ax + hy^cz — d, -\-cH = d\ 

where a, h and c are constants. 

13 . From the result of 18.6 verify that the equations 

aia; + 6it/ + CiZ + d, = 0, 

• o,x + 632/ + c,z + d, = 0, 

have a unique solution which can be written 


X y z \ 



where J,, J, and are the co-factors of A^, A,, A, and A4 in 
' ' A, A, A. A4 

Ui C|^ d-j^ 

a, 63 c, dj 

a^ b^ C3 d^ . 

Generalise this form of the solution of linear equations. 
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14, Show that the solution of the three equations of the previous example 
satisfies a fourth equation a^x + h^y + C 4 Z + (^4 = 0 provided that 

Uj by Cj dj = 0 . 

Oj 6 , Cj d, 

a. 6 , c, d, 

O 4 2)4 C 4 

This is the condition that four linear equations in three variables are con- 
sistent. Generalise it. 

Quadratic forms 

16. Show that (4xy - 2x* - 3i/*) is negative definite and (re* + y* -I- 2 * - yz) 
positive definite for all values of the variables (not all zero). 

16. If X and y take any values subject to 2a? + y = 0 , show that (a;* + y • + 3xy) 
is negative definite. Is this quadratic form also negative definite for any 
values of the variables? 

17. Show that (xy yz + xz - x^ - y* - z*) is negative definite for values ol 
the variables which are such that a? + y + z = 0 . 

18* If Of bt h, OL and have positive values, show that the quadratic form 
(oa?* + 4 - 2hxy) is always positive definite subject to oa: - jSy = 0 . 

19. u=:f{Xt y, z) is a function of three independent variables. Obtain the 
second differential dht and find the conditions that it is positive definite for 
values of da?, dy and dz, (a) without restriction and ( 6 ) subject to 

du da? -f/y dy +/, dz = 0 . 

20. Write the condition that (oa?* + 6 t/* + 2/w:t/) is positive definite subject 
to.aa; 4 - /9y = 0 . Show that the quadratic form is positive definite for any values 
of a and p under conditions identical with those required for the form to 
be positive definite for unrestricted variation of x and y. Generalise and 
indicate the relation of the conditions for relative definite quadratic forms 
to those for unrestricted definite quadratic forms. 



CHAPTER XIX 


FURTHER PROBLEMS OP MAXIMUM AND MINIMUM 

VALUES 


19.1 Maximum and minimum values of a function of several variables. 
The definition of extreme values of a function of any number of 
variables can be put most precisely in the form (see 17.6 above) : 

The function y=f(x^, X 2 , ... xj of n independent variables has 
a maximum (minimum) value at Xi=ai, X 2 = a 2 y ... if 

+ + ••• + « 2 » ^ negative (positive) 

for all small values of a?!, arg, ... x^. 


The analysis which follows is set out, for convenience, in terms of 
functions of two variables. It is, however, quite general in form. 

It is assumed that the function z=f{x, y) has continuous partial 
derivatives up to any desired order at the point (a, 6). Then Taylor’s 
Theorem gives, for any integral value of w, * 


f{a + x,b+y) -f{a, b)=df + ^d^f+^d^f+...+^d”f+B„{x, y), 


where the successive diflFerentials of the function are taken at the 
pomt .(a, 6) and with arbitrary increments dx=x and dy = y, and 
where 

Rn {x, y) = ■ ^ point (a -h 0a;, 6 + By), 

\7l -{• l 

As before, d denotes some positive fraction. 


(1) Suppose that df ^0 for some variations in the variables from 
the point (a, 6). JThen by the continuity of the partial derivatives 
of the function, dfj^O also for some variations from the point 
(a 4- 0a;, 6 -f 0y), provided only that x and y are sufficiently small. 
Bz Bz 

further, if df=x^+y:^ happens to be positive for certain Faluos 
dx By 
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of X and y, then it must be negative for values of x and y which are 
numerically equal but opposite in sign. Hence, df must take both 
positive and negative values for different variations from the point 
(a + 0a;, 6 -f 6y). Taylor’s Theorem {n = 0) gives 

/(a + a;, b-hy) -f(a, b) =df at (a -f 0a;, 6 + 6y), 
i.e. /(a 4- a;, 6 + f/) -/(a, 6) is sometimes positive and sometimes 
negative according to the values allotted to x and y. It follows, from 
our definition, that /(a, 6) cannot be an extreme (maximum or 
minimum) value of the function. If 2 =/(a;, y) has an extreme value 
at (a, 6), then df must be zero for all variations from (a, 6). 


(2) Suppose that df=0 for all variations while d^^O for some 
variations in the variables from the point (a, 6). By continuity again, 
we can take d^f^Q also for some variations from (a-f 0a;, 6 + 0^), 
provided that x and y are sufficiently small. Taylor’s Theorem, 
with n = I, now gives 


f(a+x,b + y)-f{a,b) = 


at (a + 0a;, 6 + 6y) 


and, using the definition of extreme values, we can distinguish three 
cases ♦ : 


. (a) If dy<0 for all variations from the point (a, b), then z=f(x, y) 
has a maximum value at this point. 

i* (6) If d^f>0 for all variations from the point (a, 6), then z =f{x, y) 
has a minimum value at this point. 

(c) If d^f is positive for some variations and negative for others, 
then z =/ (a;, y) has a stationary, but not an extreme, value at the 
point (a, 6), i.e. the point corresponds to a “ saddle point ” of the 
function. 


(3) If df=d^f=0 for all variations from the point (a, b), it is 
necessary to proceed to the terms in the Taylor series which involve 
the third and higher order differentials. The analysis, though 
possible, becomes complex and, as the cases considered are not 
often met with in practice, we need not examine them. 

Collecting our results and extending them to tho general case of a 
function of any number of variables, we have 

♦ It is possible that d*f < 0 (or d*f > 0) for all variations except some which 
i^ake (f*f—0. This is not included in the three cases here distinguished an^ 
further treatment is necessary to allow for such possibilities. 
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GENERAL CRITERION FOR MAXIMUM AND MINIMUM VALUES 

(1) If j/=/(a^i, ••• ^n) has an extreme value at 

= ... then dy = 0 for all variations of the variables 

from these values. 

(2) If di/ = 0 for all variations and d^y^O for some variations 
of the variables from the values (a^, ag, ... «„)» then y has a 
stationary value at this point which is a maximum value if 
d^y<0 for all variations, a minimum value if d^y>0 for all 
variations and a “ saddle ” value if d^y takes positive and 
negative values for different variations. 


The conditions given are respectively necessary and sufficient for 
extreme values. But the criterion is not complete, i.e. not necessary 
and sufficient, since no allowance is made for cases where d^y = 0 for 
all variations from the values (o^, Ug, ... a„). 

The differential form of the criterion can bo expanded to give 
conditions relating to the partial derivatives of the function. The 
necessary condition (dy = 0) simply implies that all tlie partial 
derivatives of y must vanish at an extreme value (see 14.2 above). 
The sufficient conditions relate to the second differential 


=fx,x, +/x,x Ai dx^ + • . • 

f dx^dx^ ■+■ f dx^ + . . • 

-f ... , 


all the partial derivatives being evaluated at the point (a^, Ug, ... a„). 
A maximum (minimum) value of y is obtained if this quadratic form 
is negative (positive) definite for all variations dx^^ rfxg, ... dx^^ The 
conditions for either case are given in 18.9 above. 

In the case of two independent variables, a ponit where /* =/y = 0 
gives a maximum value of z =f(x, y) if 

d‘‘Z —f XX dx^ “H f XV dx dy 
-vjxydx dy -Vfyydy'^ 

is negative definite; f.e. if 


/«*<0 and 


/a 

hv 


f XV 

fvv 


> 0 . 


^he point gives a minimum value of z if the first inequality is 
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reversed. The second inequality, which is the same in both cases 
of extreme values, is 

fxxfvv^ ifxy)^* 

The conditions are those previously given (14.2 above). 

If there are three independent variables, then a point where 
= 0 gives a maximum value of u=f{x, y, z) if 
dhi dx^ -hf^y dx dy dx dz 

-\-f^y dx dy +fyy dy^ +fy^ dy dz 

■{■f^^dxdz -^fyjydz -\-f,,dz^ 
is negative definite, i.e. if 


/..<o. 


> 0 and 

f XX fxy f xt 


fxy fvv 


f XV fvv fy» 




fxz /v« 


The point gives a minimum value of u if the first and third inequal- 
ities are reversed. 

19.2 Relative maximum and minimum values. 

The advantage of the differential form of the criterion for extreme 
values is that it applies, with suitable restrictions, even when the 
variables are not independent. A function y=f{Xi,X 2 , has 

an extreme value relative to a number of given side relations, 
^1 (^i> • • • ^n) = ff > ^2 (^i> • • • ^n) = ff > • • • > if f b© conditions previously 
written are satisfied, provided that the differentials of y are expressed 
in terms of the independent variables only, the other variables being 
eliminated by the side relations. 

With a single side relation, we seek the extreme values of 
t/=/(ari, Xg, ... relative to <f>{xi, X 2 y ... Xn) = 0. The necessary 

condition is that 

dy =u dxi +f^ dxt +/*, da;, + . . . dx„ = 0 

for all increments which satisfy 

dxi + dx2 + <l>x^ dx^ + . . . + <f>Xf^ dXf^ = 0 . 

Taking x^ as the dependent variable given by the side relation, we 
can eliminate dx^ and obtain 

dy =.(a -U dx, + [u -U I;) + (/*. 
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for all increments in a;,, x„. Hence, 

fxi _ fxt _ /«« _ — An ^ 

4*xt A* ^atn 

These are the necessary conditions (see 14.6 above). To obtain 
sufficient conditions, we write 

=fxx -^fxxxx dx^ -^fxxx, dx^ + ... + 2/,^^ dx^ dx^ + ..., 
where dx^ + dx^ + . . . + dx^ = 0 

and <f>^^ d^x^ + dx^^ + dx^^ + ... -f dx^ dxj 4- ... = 0. 
Eliminating d^x^ by means of the last relation, we have 

= (/*.*, - ^ + (/*.*. - dxi c/jTa + ... 

+ (/»i*i “ ^ ^*1*1^ dx^ dx^ + (/r,*, — dajj* + ... 

-f ..., 

where dx^ + . • . + = 0. 

A point which satisfies the necessary conditions gives a relative 
maximum (minimum) value of y if d'^y is a negative (positive) 
definite quadratic form for all increments dx^y dxg, ... dx^ subject to 
the relation written above. The conditions for either case, given in 
18.9 above, indicate the appropriate signs for the principal minors 
of the determinant formed from the coefficients in “ bordered 
with the partial derivatives ^ar,> ••• 

In the case of two variables, a point satisfying the necessary con- 

f f 

ditions if>{x, j/)=0 and ^ = ^ gives a maximum value of z=f{x^ y) 
9m 9v 

relative to (f>{x, y) = 0 if 

. dH = ^ 4>^)j dx* + (/,, - dxdy 

is negative definite subject to ^*da;H-^ydy = 0, i.e. if 




fmy 

fyv 


<f>y 



(f>yy 
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Expanding the determinant, the condition is 

2 (/., - ^x.) ^x<Av - (/.. - - (/« - ^.*> 0. 

i.e. 


^xifxz^V^ ^fxy^xi^y ^ fyy^x^) fxO^xx^y^ ^^xy^^x^y ^vv^x^) 

This agrees with the result previously obtained (14.6 above) when 

f f 

we remember that ^ ^ by the necessary conditions. 

YX YV 


19.3 Examples of maximum and minimum values. 

The results of the previous two sections are illustrated by the 
following examples : 

Ex. 1. If w= a;* + 2 /^ + 2 ^ is a function of three independent variables, 
then du = 2{x dx +y dy -\-z dz) and d^u = 2 (dx^ + dy^ + dz^) . 

Extreme values of u occur only where du=0 for all variations, i.e. where 
Of =y =2 =0. Since d^u^O for all variations from any point, these values 
must give a minimum value of u. Hence, u has a single minimum value, 
u = 0, which occurs where a; = y = 2 = 0. This is also clear since u is positive 
except when x, y and z are all zero. 

** Ex. 2. If a:, y and z are independent variables and 
u^x-\-2z-\-yz -x^ -y^ -z^f 

then du^{\ - 2x)dx + {z- 2y) dy + (2 -hy - 2z) dz 

and d!^u = - 2 [dx^^ + dy^ + dz^) + 2dy dz. 

Extreme values of u can occur where 

l- 2 a;= 2 -~ 2 y = 2+y~2z=0, 

i.e. x=\, y = \, z=f, giving « = i|. 

But d^u is negative definite at all points since 


2<0, 

-2 0 

«4>0 and 

-2 

0 

0 


0 -2 


0 

-2 

1 




0 

I 

-2 


Hence, u has a single maximum value 

Ex. 3. To find the shortest distance of the origin '^f co-ordinates from 
the plane with equation ax +by -\-cz^d : 

Let u be the square of the distance from the origin to the point with 
co-ordinates (x, y, z) on the plane. Then 

u * X* 4- y® + 2 * where ax 4- 6i/ + cz = d. 
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The relative maximum of u is required. The necessary condition is 
du^2{x dx dy-\-z dz) =0, 
subject to a dx + h dy c dz ^0. 

a b c 


So 


Since ax + by ■^cz=^dy these equations give 

ad bd cd 

*“'a* + 62 +c»’ ^~~a^+b^+c*’ ^^a^+b’^+c* 
and the corresponding value of u is 

d^ 

^ -f 6^ + c* 

Taking the equation of the plane as giving a: as a function of y and z, then 
d^x^O, and 

d^u = 2x d^x + 2 {dx^^ + dy^ + dz^) = 2 {dx^ 4- dy^ + dz^) > 0 
for variations from any point. The positions obtained is thus one of 
relative minimum. The shortest distance required is the square root of 
the minimum value of u, i.e. it is 

d 

N/a^ 4-6^ +c* 


Ex. 4- To find the rectangular block of maximum volume we can cut 
from a sphere of radius a (14.7, Ex. 2), we require the maximum value of 
V *= %xyz relative to x"^ -\-y^ -^-z^ ^ d^. The necessary condition is 
dV —^{yzdx^-xzdy -k-xy dz)^0 subject to 2{x dx -\-y dy -\-zdz)=^0, 

yz xz xy ^ 
x'~ y'' z 

a . . 8a* 


Hence, 

Using a;* 4- y* 4- 2* = a*, we find x=^y^z=^ 


_^3, giving 


Now, d^V — Syz d^x ^\^[z dx dy ^-x dy dz ^-y dx dz), 

where x dx -\-y dy + z dz=^0 

and • z d^x 4* dx^ 4- dy^ 4- dz^ = 0, 

where the given relation determines a; as a function of y and z. 

eliminating# c^*a:, we have 

d^ V = 8{2z dx dy 4- 2a; dy dz -\-2y dx dz - ^ {dx^ 4- dy^ 4- dz^)). 


On 


where 


At the point where x — y^z^ 


xdx^-y dy^zdz^^. 
a 


8a 


73 ’ 


d* F = (2dx dy 4- 2dy dz 4- 2dx dz - dx^ - rfy* - dz-), 


subject to 


73 


da;4-dy +dz=0. 
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For a relative maximum of F, the quadratic form 
- dx^ dy +dx dz 

+dx dy -dy^ +dydz 
+ da; dz+dy dz -dz^ 

must be negative definite subject to dx ■\-dy -{-dz^O. The conditions for 
this are satisfied since 


0 

1 

1 «4>0 and 

0 

1 

1 

1 

-1 

1 

1 

-1 

1 

1 

1 

-1 

1 

1 

-1 




1 

1 

1 


Hence, V « 


8a» 

3^3 


is the relative maximum value of 


1 

1 

1 

-1 

F. 


- 12 < 0 . 


19.4 The stability of demand for factors of production. 

The demands of a firm or industry for two factors of production 
(14.8 above) are stable if the constant product curves, obtained from 
the given production function, are convex to the origin at all relevant 
points. The conditions for equilibrium and stability of production 
can now be examined in more general cases. 

A good X is produced with n variable factors Ai, ... 

according to the production function x=f{aiy ag, ag, ... a„), which 
is assumed to have continuous partial derivatives of the first two 
orders : 

dx d^x 

Our first problem is, given the market prices pa> Pz* ••• Pn of the 
factors, to find the grouping which produces a given output x at mini- 
mum cost. We have to minimise Tl^OiPi+a^Pi-^a^Ps-i- ... -^a^Pn 
relative to /(Oj, a,, a,, ... a„)=x. The necessary condition is 

dfl =pi dcb^ + Pa ^2 + Pz ^8 + • • • + Pn dttn = 
subject to /i doi +/a da^ +/„ dan = 

/i /«■ 

There are (n-1) equations which, with the given side relation, 
determine the equihbrium employments of the factors in terms of 
the given output and the given prices of the factors. We thus have 
^'tlje demand for each factor and we can write 

= Pv Pz> Pi> — Pn) (r = l, 2, .3, ... n). 
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f 

Taking the side relation as giving % as a function of a,, a,, ... a„, 
then, differentiating twice, we have 

d^di +/u dui^ "^/22 dci^ + . . . + 2/12 ^1 du^ + • • • =0. 

But d^n^d{p^ da^^p^da^ + ... + jPn da^) -Pi d% 

~ {fii dcij^ -h ^22 dd^ + . . . + ^fi2 do/^ ddg ■+*••.). 

Ji 

For a minimum value of 77, we have d'^n>0 and 

/ii doj^ +/ia da^ da^ -h /13 da^da^ + ... ^/m da^ da^ 

4-/12 ddi da^ +/2a da^^ 4-/23 da^ da, 4- . . . 4-/2n da^, da^ 


4-/in da^ ddn +/2n da2 ddn ^fzn dd^ dd^^- ... 4-/nn da„» 
is a negative definite quadratic form subject to 

ddi 4-/a dd2 ^/s dd^ -f . . . 4-/n dd^ — 0. 

From 18.9, ( 2 ), a sufficient condition is 


0 

A 

A 

>0, 

0 

A 


A 

A 

fn 

Aa 


A 

fn 

fl2 

/l3 

/s 

/12 

fzz 


A 

fl2 

fzz 

Aa 





A 

/l3 

Aa 

Aa 


Suppose that/i, fzy fzi •••fn ^re all positive and that the inequalities * 
above are satisfied for all combinations of factors within a relevant 
range. Then, for dny set of outputs and prices of the factors (within 
a certain range), equilibrium is possible and the position determined 
is stable. We have the stability conditions for the demand for 
factors of production, conditions which are given as limitations on 
the form of the production function. For a production function 
satisfying the conditions, the demand functions for the factors 
are uniquely determined and stable. If there are only three factors, 
it is •easily shown that the stability conditions imply simply that 
the constant product surfaces in factor space are downward sloping 
and convex to the origin at all points, at least within a certain range. 

19.5 Partial elasticities of substitution. 

We require thp following notations which represent certain 
features of the production function x—f{d^, d^, d^, ... a„). For any 
combination of the factors, define 

dffr 

~ Ol /l 4 «l/i 4- . . . + ajn 


{r==l, 2, 3, ... n). 
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Hence, /c 2 + ^8 + *** = 

At an equilibrium position, f^yf^yfzt ••*fn o^re proportional to the 
prices pi, p 2 > Pzy ••• Pn of the factors. Hence, 




aiPi + a2P2 + ---+«n:Pn n ' 


i.e. Kf, is the proportion of total cost (11) which is spent on the 
factor Af. 


Let 




^ fi A fs 

fl fll fl2 /l3 

fz fl2 /22 /23 


I fn fin fzn /31 


fn 

fin 

fzn 


fn 


and let F^^ denote the co-factor of/,., in (r and 5 = 1, 2, 3, ... n). 
Write ^ + ^ 2/2 + • ♦ • + ^ 3 = 1 ^ 2 , 3, ... n). 


Q/fCt , 


For any unequal values of r and 5 , the value of a,., is called the 
'partial elasticity of substitution of the pair of factors and A, (as 
against all other factors). Its value depends on, and varies with, 
, the grouping of factors employed. The interpretation of the partial 
elc^^ticities of substitution will appear in the following section and it 
can be shown that they are related to the ordinary elasticity of 
substitution between two factors as defined in 13.7 above (cf. 
Examples XIX, 9). 

A number of relations exist between the values of a,., for various 
values of r and s. If each element of the first row of F is multiplied 
by the co-factor of the corresponding element of another row, then 
the sum of the products is zero (18.4 above). So 

flFn+f2^r2+fsFr9 + —+fn^rn = 0 (r = 1, 2, 3, ... n). ' 
From the definitions above, we have 


_ fl^rl ^ a — 

^ a^F ’ * a^F ’ 

So KiGri + K2Cr^2 + + . . . ^n<^rn 

We can write this relation in the form 



= 0 (r = l, 2, 3, ... n). 


"t^2^r2 *+■••• + + ^r+l<^fr+l + ... “ ^r^rr* 

’The last two inequalities of the stability conditions (19.4) imply that 
F and are of opposite sign. The same is true of F and F^r for 
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any value of r.* But a^r is proportional to the ratio of to F 
and so 

arr<0 (r = l, 2, 3, ... n). 

Hence, for a production function satisfying the stability conditions, 

^l^rl ^2^r2 • • • “H + ^r+l^rr+1 ■+■•••■!• ^n^rn ^ ^ 

for any value of r (r = 1 , 2, 3, ... n). In any case, there are n relations 
limiting the values that can be taken by the partial elasticities of 
substitution for any grouping of the factors.t 
The values of the partial elasticities of substitution can be positive 
or negative. But the limitations above show that the positive values 
must be more numerous or important than the negative values. In 
particular, the (w 1 ) partial elasticities of substitution between any 
one factor and the others cannot all be negative. If there are only 
three factors and ^ 3 ), then 

^ 2^12 “i" ^8^13 ^ ^ 1^12 "t" ^8^23 ^ ^ and “f" #^2^23 ^ 

It follows either that all three partial elasticities of substitution 
(cti 2 , <ti 3 and <t^^) are positive or that one of the partial elasticities is 
negative and the other two positive. 

19.6 Variation of demand for factors of production. 

It is now assumed that there are constant returns to scale in the 
production of the good X, the production function x=f{ai Ug, a 3 , . . . a„) 
being linear and homogeneous with the properties set out in 18.5 
above. Given the output (x) and the prices (pi, Pz, P 3 , ... Pn) of the 
factors, the demands for the factors are determined by 

A A /a /» ’ 

and « 3 . ••• On) =»• 

The average cost of the output x is 

^_ 0 lPl+ 02 P 2 + ---+ 0 nPn ^^ 0 l/l+ 02/2 + ---+ 0 n/n _^ 
XX X ' 

Bmceai/i+Oa/ 2 +as /3 + ...+a„/„=a:by Euler’s Theorem. Further, 

* The order in which the factors are enumerated is immaterial ; we can 
take the factor last just as well as the factor 
• t The other conditions of stability give rise to further limitatioAS on^ho 
partial elasticities of substitution. 
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if the output varies while the prices of the factors remain fixed, the 
marginal cost at the output x is 


m 

dx 


dCln 


ddi 

dx 


dct^ 

dx 


dx 




dx 


\ ( f , / ^2 , / ^^3 , , / \ 

^ ^ ^ J 


since /, ^ + /j ^ +/8 ^ 4 - . . . 4 -/^ ^ from the production func- 

dx dx dx dx ^ 

tion. Average cost and marginal cost are equal and, therefore, 

constant for all outputs. 

Extending the problem of 14.8 above, it is assumed that X is sold 
on a competitive market at a price p equal to the constant average 


cost. Then the common equilibrium value of 
equal to p, i.e. ” 

Pr^Vfr (r=l, 2, 3, ...w), 


IS 


which is the law of ‘‘ marginal productivity Let the demand of 

n dx 

the market for X be given by a?=^(p) with elasticity 17 = 

Then the position of competitive equilibrium is described by ^ 

/(Oj, a2, ctj, ... ctn)^^{p)f 


and 


fl /2 /s fn 


These equations determine the values of a^, ag, ag, ... a„ and p in 
terms of the given prices of the factors. The output x = <f>(p) and 
the total cost i 7 =xp=aiPi 4 -a 2 F 2 + ® 8 l ^8 + *** +®nPn fl-re also given 
in similar terms. 

The demands of the competitive market for the factors are deter- 
mined, by the equilibrium conditions, as functions of tlje prices of 
the factors. Let one of these prices (pi) vary while all the other 
prices remain fixed. The resulting variations in the demands for 
the factors are then to be found by diJBFerentiating the equilibrium 
equations (which hold for any prices of the factors) with respect 
to Pi. From /(oi, o„ o„ . . . o„) = ^ (p), 

* . ^ / 3og , , , aa„ X dp 
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From pfi=Pi, ... pfn=Pn, 


f (f 


+ f +f — " 

dpi^ dp, 




Hence 


dpx ^ V“ dp, dp, ■*■ - dp,) 

^.f ?£?4. .f 

•^" 9Pi ^ V 9Pi dp, ■” ai)i/ ■■®' 

( 1 3i)\ . Sai - Sa- , dUn ^ 

• • * 

/« 


\p dp,) 


del, ^ ^ dd^ . ^On 


F stands for the determinant already written (19.6 above) and 
Fj is the co-factor of the element 0 in F. Further, F„ and F#,, . 
denote the co-factors of the element /„ in F and F# respectively 
(r and « = 1, 2, 3, ... «). Then, by 18.6 and 19.6, 


Fo = 0 , F„=^FK,cr,,=-j^F/c,a, 


T|T dfdg -p 


7.^*'"" /r 

where a^, is the partial elasticity of substitution of and A, and 

«r/r ^rPr 


ic, = - 


'Ol/l+« 2/8 + — +«n/n 

is the proportion of total cost (= total receipts) spent on A,. 

Usmg the determinant notation (18.6 above) to solve the linear 

equations ^bove for ^ , we find 

da, 
dp,' 


X7J 

0 

ft 

••• fn 


a;’? 

fl 

ft 

••• fn 


1 




/i 

fll 

fit 

• • • fin 

fi 

• 3 

P 

fl2 

••• fin 


ft 

fit 

ftt 

••• ftn 

ft 

0 

ftt 

••• ftn 


fn 

An 

ftn 

••• /nn 

fn 

0 

ftn 

• • • fnn 
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With the aid of 18.3, (6), the numerator can be written 


fz ••• fn 

-if 

^ A ••• fn 

+ 

^ Si • 

..fn 

A /22 An 

p \ 

fl fll ••• An 


^ fll • 

. fin 

fn An ••• An 


fn fin ••• An 


0 fin . 

.. fnn 




since pfi =Pi and = 

^ fi A 
fl fll fl2 
A /l2 /22 


<^iPi 

xp 


The denominator equals 


I fn fin f^n 
Hence, 


fn 

+ 

xr^ 

A 

A 

fn 

= F + X7}Fo 

fin 


0 

fll 

fll ••• 

fin 


fin 


0 

fll 

fll ••• 

fin 


fnn 


0 

fin 

An ••• 

fnn 



/ \ 

^Pi Pi 


i.e. 


Pi da 


-=^i(<^ii -’?)• 


4-g_Pida2_ . 

■ air ' 


l) 


Ea^ 

Epi ai dpi 

Proceeding in exactly the same way, we find 
Ea^ 

Epi a^dpi 

aqd similar results for the demands for other factors. 

The results, though expressed in terms of variations in the price 
Pi, are clearly quite general. We can, in fact, write one formula to 
express the effect of a change in the price of any factor on the demand 
for any factor : 


Epr 


-7j) (r and ^ = 1, 2, 3, ... n). 


These expressions for the partial eleisticities of demand for the 
factors are of the same form as those obtained in the case of two 
factors (14.8 above). Their interpretation proceeds as before but 
some new conclusions are now reached. If the market price of one 
factor rises, then the demand for this or any othpr factor is affected 
in two ways. Firstly, the cost of production is now higher and the 
product dearer. For a decreasing demand law (97 > 0), the amount of 
the product sold is less and there is an all-round and proportional 
deoreaae^in the demand for the factors. This effect is shown by the 
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• 

negative term ( - K^r)) which appears in each of the elasticities (fi 
demand with respect to the price of A^, Abstracting from this effect, 
the factor -4,^ is now relatively more expensive than other factors 
and it pays to substitute other factors for -4,. in production. The 
demand for A ^ thus decreases on account of substitution, as shown 

Ea^ 


by the negative term in 


Ep/ 


(By 19.5 above, is negative 


for stable demands.) The effect on the demand for one of the other 
factors is more complex but the net result (on the demand for A ,) 

Ed 

is shown by the term («rra„) in j~ . There are two cases : 


(1) If a ,.,>0, then the demand for A^ increases on account of 
substitution ; the factor A, takes part in the replacement of in 
production. In this case, tlie factor A, is said to be competitive 
with the factor A,, at the grouping of factors considered. 

(2) Ifa„ < 0, then substitution results in a decrease in the demand 
for 41, ; the factor -4,, like the factor A^y has been partly replaced 
by other factors in production. Here, the factors A^ and A, are 
said to be complementary at the grouping considered. 

The sign of a,.,, therefore, indicates whether A^ and A^ are com- 
petitive or complementary factors. From the limitations described 
in 19.5, we see that competition between factors is, on the whole, 
more general than complementarity. One factor, in any case, cannot 
be complementary with all other factors. Where there are only 
three factors, for example, either all factors are competitive with 
each other or one pair of the factors is complementary while the 
other two pairs are competitive. 


19.7 Ifhe demand for consumers’ goods (integrability case). 

We have considered (14.9 above) the demands of a consumer with 
a given income when there are two goods obtainable on a market at 
given prices. The demands are stable if the indifference curves of 
the consumer are downward sloping and convex to the origin at all 
relevant points. Irj extending the analysis to the general case where 
there are more than two goods available, we assume first that the 
scale of preferences of the consumer can be represented by an in- 
difference map and a utility function. Complications relating to 
integrability ” are deliberately avoided in this first appros«3h. ^ 
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If there are n goods X,, X„ ... X„ let one form of the utility 
function of the consumer be 

a:,, ... x^) 

with continuous partial derivatives of the first two orders i 

, du , , d^u , - , ^ « 

and <f>r.=^^ (rands = 1.2.3....n). 

The consumer has a given income fi and can purchase the goods at 
given market prices, Pn P29 Pa •••Pn- He makes his purchaises so 
that u has a maximum value subject to the condition expressing the 
fact that he must balance his budget : 

^iPl + ^ 2 P 2 + ^zPz + ••• + ^nPn 

Since each utility function increases and decreases with any other, 
a maximum of u corresponds to a maximum of any form of the 
utility function and our resiilts are thus independent of the fact that 
utility is not a measurable concept. The necessary condition for the 
maximum value of u we seek is 

dw = <^ dxi + <l>2 dx2 + ^3 dx2 + ... + dXn = 0, 

subject to Pi dXi -f Pj ^^2 -^Pz d^Z “h . . . +Pn 

ie =^. 

Pi Pz Pz Pn^ 

These (n- 1) equations, with the given side relation, determine the 
Equilibrium purchases of the consumer in terms of his given income 
and the given prices, i.e. the consumer’s demands as functions of 
P'* Pli PZf Pzf ••• Prf 

The condition of “ budget balance ” gives as a linear function 
of the other purchases and so d^Xi:=^ 0 . It follows that a sufficient 
condition for maximum u is that 

dht = (f>ii dx^ + <f>i 2 dxi dx 2 + ^13 dxi dx^ + . . . + dxi dx^ 

-f <f>i 2 dXi dX 2 + ^22 ^^2^ + ^23 ^^2 ^^3 + • • • + ^2n 

+ 

+ An dXi dx„ + An dXi dx„ + An dx^ dXn + . . . + An dx„' 

is a negative definite quadratic form subject to ‘ • 

Pidxi + p 2 dxt+ psdx, + ... + p„ dx„ = 0, 

^i.e. to Adxi + Adx 2 + Ad^a + -•' + Adx„=0, 

(using the necessary conditions) 
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•» 

Hence, by 18.9, ( 2 ), 

9 (f>i (f>f ^9, 0 ^ <f>^ “^9, ••• • 

^ <^11 ^12 <^8 

i>i ^12 ^22 ^2 ^12 ^22 <^23 

<^8 <^13 <^23 ^33 

The equilibrium position and the demand functions are stable for 
any income and market prices if <f>i, ^ 3 , ... <f>n are positive and if 

the determinant inequahties above are satisfied for all purchases cf 
the consumer, at least within a relevant range. These stability 
conditions, it should be noticed, are limitations on the form of the 
utility function. When there are only three goods, the conditions 
imply that the indiflference surfaces of the consumer are downward 
sloping and convex to the origin at all points, i.e. the “ normal 
form of the indiflFerence map is sufficient for stabihty of demand. 
The stability conditions appear in a form which involves a parti- 
cular utility function and it is not clear that they are independent 
of the non-measurable character of utility. We can, however, trans- 
late the conditions so that they involve only the marginal rates of 
substitution between the goods (16.8 above). The ratios 

4>i' 41 7 4i 

express the marginal rates of substitution of for X,, X,, ... 
respectively. The necessary conditions for equilibrium show that 
these ratios are equal to the corresponding ratios of market prices, 
i.e. the equilibrium purchases of the consumer are such that each 
marginal rate of substitution equals the ratio of the prices of the 
goods concerned. Now 

^ _ 4i24i ~ 4n42 ^ f 4^ _ 42z4i ~ 4i24% 

, 4i^ ’ dx^\<f>J 

and similarly for the partial derivatives of the other marginal rates 
of substitution. The determinant inequalities of the stability con- 
ditions, using these results, can be reduced to 
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'l^he stability conditions thus involve only the marginal rates of 
substitution and are independent of the actual form chosen for the 
utility function. 

Exactly as in 19.5 above, wc define 

_ ^1^1 ■4'ir2^2 ■!"••• 

(r and 5 = 1, 2, 3, w), 

where 0,., is the co-factor of in the determinant 
0 = 1 0 (f>i <f>2 <f>s ••• 

01 011 012 013 ••• 01n 

02 012 022 023 ••• 02n 

I 0n 01n 02n 03n ••• 0nn 

At the equilibrium position, k^, represents the proportion of total 
expenditure {fi=XiPi + X 2 P 2 + +^nPn) which is spent on the good 

X^, The value of is defined as the partial elasticity of substitution 
of the pair of goods and X, in consumption. 

The demand of the consumer for each good is defined as a function 
of /X, Pit p 2 t Pti ... Pn by the conditions 

Pi P2 Pz Pn 

aijd X^i + X2P2 + ^ZPZ + • • • + ^nPn =fX. 

The parameter A (dependent only on /x, p^t Pz> ••• Pr^ is introduced 
for convenience and often described as the “marginal utility of 
money Following the method of 19.6 above, these equations can 
be difiFerentiated with respect to /x or to one of the prices to give the 
variations of demand for any good : 


Ex, udx, , Ex, Prdx, 
-pr-^ = — — and 

Efi X, ofjL Ep^ X, apr 


(r and 5 = 1, 2, 3, ... 



It is found (see Examples XIX, 17 and 18) that 


(r and 5 = 1, 2, 3, ... n), 

* ^ 

It follows that the demand for each good is affected in two ways 
by an increase in the price of any one good (X,.). Since the money 
Income of the consumer is fixed, the increase in the price results in a 
lowlr re&l income and causes a change (usually a decrease) in the • 
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purchase of each good. This is shown by the term ( - f<r in the 
Ex \ Efx / 

* Further, the good Xf is now relatively more 


expression for 


Epr' 


expensive than other goods and the consumer proceeds to substitute 
other goods for in consumption. The demand for X^ is thus 
decreased, as shown by the negative term in the expression 

Ex 

for , (The value of a^r is negative by the stability conditions.) 

The effect of substitution on the demand for one of the other goods 

Ex 

(X,) is represented by the term in 77 -^ . The consumption of 

Epr 

Xg increases or decreases on account of substitution according as 
this term is positive or negative. Hence, and X, are competitive 
in consumption if or,.,>0 and complementary if cr,.,<0. As in the 
production problem, the stability conditions impose limitations on 
the values of a,., (r and 5 = 1, 2, 3, ... n) which imply that the com- 
petitive relations between the goods outweigh, on the whole, the 
complementary relations.* 


19.8 Demands for three consumers' goods (general case). 

In generalising the results of the previous section, we must assume 
that the preference scale of the consumer is defined, not by a utility 
function and a complete indifference map, but only by the marginal 
rates of substitution between the various goods at different levels of 
consumption. To simplify the exposition, we take the case where 
there are three goods X, Y and Z and make use of the notation and 
results of 16.8 above. Each possible set of purchases of the con- 
sumer is represented by a point (x, y, z) in space referred to axes 


* We have given here definitions of competitive and complementary goods 
which are ^jiore strict than those previously suggested (12.6 above). On the 

old definition, X- and X, are competitive goods if — * and are both 
Ex f Ex \ 

positive. From =r-^ = ac, ( < 7 -. - ) , we see that the new definition (a,., > 0) 

EPr Ex, , Ex, 

usually implies tlfb same thing. But one or both of and can be 

negative if or is positive and greater than a,.,, i.e. if the effect on 
Eijl Efi 

demand via changes in real income is stronger than the substitution effect. 
Similarly, the two definitions of complementarity are roughly, but not ex^igtly, 
in agreement. 
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Ox, Oy and Oz. The preference scale of the consumer is then 
described by an indifference plane dx-\’Rydy-{-R^z=^0 at any given 
point of space. Here, R^ and R^ are given functions of x, y and z 
(the purchases of the consumer) which express the marginal rates 
of substitution of X for Y and Z respectively. The functions are 
assumed to have continuous partial derivatives. The differential 
equation of the indifference plane is not necessarily integrable and 
we are not at liberty to assume that a utility function or a complete 
system of indifference surfaces exists. 

With given market prices, p., py and of the goods and a given 
income /x, the purchases of the consumer must conform to the con- 
dition of ‘‘ budget balance ** + + and any variations 

in the purchases from an established set {x, y, z) must satisfy 

Pa; dx dy -hPs dz = 0. 

If the purchases of Y and Z are increased by dy and dz respectively, 
the necessary decrease in the purchase of X is 

{-dx)i,=^dy+^dz. 

PoB Pm 

f 

The indifference equation shows that the compensating decrease is 

( dx)c=Ri dy + R^ dz, 

* 

If the necessary decrease is less than the compensating decrease, 
then the consumer tends to increase his purchases of Y and Z as 
indicated. Conversely, if the necessary decrease is greater than the 
compensating decrease, the consumer wiU tend to decrease his pur- 
chases of Y and Z, In fact, equilibrium of consumer’s choice is 
only possible if the necessary and compensating decreases are equal 
for all values of dy and dz. Hence, the necessary conditions for 
equilibrium are 

R^zJ^ and jB 2 =— 

P» Pm 

and the marginal rates of substitution, as before, rrf equal to ratios 
of the market prices. 

To examine the stability of demand, we suppose that increases 
iy and dz in the purchases of Y and Z have been made from 
the equilibrium position, together with the necessary decrease 
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f — dy-^^dzj in the purchase of X, The new values of the marginal 

rates of substitution are approximately {R^ + dR^ and 
where 


J D ^R\ J dRi J dR^ J J J n ^R^ J ^^2 J ^^2 j 

dRt = dx 4“ dy 4- ^ dz and dR^^ • 

ax ay dz * dx dy dz 


Suppose, now, that an exactly similar change from the new purchases 
is contemplated. The necessary decrease in the purchase of X is 


(-dx)2if——dy-\-—dz^ 

Pm Pm 

but the compensating decrease in X is now 

( - dx)c = {Ri 4- dRj) dy 4- (iZa 4- dR^ dz. 

The original purchases are stable provided that this second change 
will not be made, i.e. provided that the compensating decrease in X 
is less than the necessary decrease : 


(jRi 4- dRy) dy 4- (iZg + ^^2) dz dy dz==R^dy R^ dz, 

Pm Pm 

i.e. dRi dy 4* dR^ dz < 0, 

where the variations in purchases are related 

Pa; dx -{-pydy-i-pgdz = 0 or dx + rfy 4- R 2 dz = 0, 
using the necessary conditions. Substituting for dRi and dR^, 




i J. + rp ^ <fo iy + rp <fo * + 


dy dz 


dydz<0, 


I.e. 0dx* + -^dxdy +--^dxdz 

2 dx 2 dx 

IdRi. , ai?, , , 1 (dR^ dR^\, , 

}}R,. . i/3R, 3R,\ , , 3R,,, 

+ 2 I + -3.- + IT* 

must be a negative definite quadratic form, subject to 
dx + Ridy + Rtdz=0. 
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Hence, from 18.9, (2), 




J = 


0 

1 

R, 

R, 



0 

1 

R, 

>0, 


1 

0 

IdRi 




2 dx 



s. 

lai?, 

ai?i 



1 

2 dx 

dy 


1 


Ri 


R, 

n 


IdR^ 


lai?. 

vr 


2 dx 


2 dx 

l0iJl 


ai?, 

1/ 

dR,^ c 

2 dx 


dy 

2 \ 

.dy'^i 


1/ 

'dR^ dR, 


dRt 

2 dx 

2 ' 

\dy ^ dz 

7 

dz 


dz ) 


< 0 . 


The demands of the consumer, given by the necessary conditions, 
are stable for all prices and incomes if R^ and R^ are positive and if 
£>>0 and A<Q for aU purchases. These are the general stability 
conditions. Since an indifference map need not exist, the conditions 
cannot be interpreted as before. 

< A certain simplification can be made i 


D= - 

1 i?i 

+ Ri 

1 

Ri 


1 dRi dRi 


0 

IdRi 


2 dx dy 

1 

2 dx 


dR. 1 „ 1 _ dR, 

= - It - + 0-^1 ^ + 0-^1 - 5 - ■ 
cy 2 ax 2 ox 


p ^Ri _ dRi _ 
' ^ dx dy~ 


Further, write 




1 

Rx 

R. 

dR^ 

dRi 

dRi 

dx 

dy 

dz 



dRt 

dx 

dy 

dz 


dx 


R, 

dy 


By manipulation or expansion of the determinant?i, it can be shown 


that 

J+J'=A* 

> 

WhoCQ ^ 

A = 4. 


2 [V ' ax * ax/ 


\dz 


dR. 


9y )}' 
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The stability conditions (Z)>0 and A<0) thus reduce to 

1 R^ >A». 

dRi dRi dRi 
dx dy dz 
dR^ dR^ 9i?2 

dx dy dz 

These conditions are very similar to those of the integrability case 
but the third-order determinant shown must not only be positive but 
also greater than a certain positive amount The conditions of 
the integrability case must still hold but they are only necessary, 
and not sufficient, for stability. Notice that the integrability con- 
dition (16.8 above) is simply A = 0 and that the general stability 
conditions then reduce to those previously written. 

The demand of the consumer for each of the three goods is deter- 
mined, as a function of /x, Pa-, Py and p,, by the conditions : 

^ 1 =^, ^2 = ^’ and xp^ + yp^ + zp, = fi. 

Px Px 

This is true of all cases. In the special integrability case, the forms 
assumed by the marginal rates of substitution and by the stability 
conditions are simpler than in the general case. The results (19.7 
above) relating to the variations of demand and the partial elas- 
ticities of substitution hold only in the simpler case and nee/1 
modification when we pass to the general case. In particular, the 
distinction between goods which are competitive and goods which 
are complementary in consumption is found to be less clear-cut in 
the general than in the special case.”** 

EXAMPLES XIX 

General maximum and minimum problems 

1. Show that u = X* + 2 /* + 2 * + XI/ has a minimum value at a? = y = ^ = 9. 

2. Show Ihat w = x* + 1/* + xyz has minimum values at all points where 
x = 7/ = 0 and z< 2 numerically. 

3. If w = X + 1 / + xz - X* - 2 /* + 2 *, show that u has a single stationary value 
which corresponds tp a saddle point. 

* An analysis of the competitive and complementary relations between 

goods and of the variations of demand in the general case are given by Hicks 
and Allen, A Reconsideration of the Theory of ValiiCy Economica, 1934 . 
Tile stability conditions given in this article are not fully expressed and^he 
^present development can be taken as replacing the earlier work. 


1 i?! < 0 and 

dRi 

dx dy 
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4« Show that u — Hxyz, subject to a; + s^ + 2 = 3a, has a maximum value 
(u = a) when x = y = z = a. Deduce that the geometric mean between three 
positive quantities is always less than the arithmetic mean except in the case 
where the quantities are equal. Generalise the result to apply to any number 
of quantities. 

5. Find the shortest distance from the origin of co-ordinates to any point 
on the surface xyz = a in the positive quadrant. 

6. If a is a positive constant, show that the maximum value of 

u = a(a - x){a - y) (a - z) 

relative tox-^y-\-z = 2a occurs where x, y and z are each equal to fa. Inter- 
pret this result in the light of the fact that the area of a triangle with sides 
X, y and z is given by 's/a (a - a;) (a - y) {a - z), where 2a is the perimeter. 


Economic maximum and minimum problema 

7. A monopolist produces three goods X, and X,. The total cost of 
outputs a;, and of the three goods is -f a,a;, -f oc^x^), where aj, a, and 

«, are constants. The monopolist charges prices p^, and p^ and the demands 
of his market are 


*1 = *1. - “u (Pi - *l) - (p* - - «1. (P. - «.). 

», = *,. - o„(Pi - (Xi) - o„(p, - o,) - o„(p, - a,), 

*, = *„ - a„ (pi - a,) - a„ ( p, - a,) - a,, (p, - a,), 
where ajjp, a;,p, a?,p and the a’s are given constants. Find the prices the mono- 
polist must charge for maximum joint revenue and show that the following 
conditions must be satisfied by the o*s : 


"ll 

Oli 

>0 and 


On 

Oi, 

«11 

o** 


Oil 

0,1 

On 




Oil 

On 

On 


Extend the results of 14.4 above. 


8. Find the conditions under which the equilibrium of the general problem 
of 14.6 above is stable, expressing them in terms of the partial derivatives of 
the production function a;=/(a, b, t), 

9. A good X is produced with three variable feictors according to the 
production function x=f(ai, a„ a,). Show that vi, (as defined in 19.6 above) 
is a negative multiple of the ratio 



where the differentials apply to a variation of the factors A, and (the 
factor A I being held constant) so that product is unchanged. Express this 
result in terms of an elasticity of substitution of the kind defined in 13.7 above. 

10. Three factors are obtainable at given prices, p^, p, and p,. When 
amounts o^, a, and o, of the factors are employed, the putput of a good X is 
x = At/a^a^a^, The good is sold at the competitive price p on a market with 
demand law x = Bp~^. Find the amoimts of the factors demanded in terms 
of p^, p, and Pg. By differentiation, show that 

Ea. 1 . Ea. Ea. 1 
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r 


11. For the production function of the previous example^ show that the 
demands for the factors are stable and verify that the partial elasticities of 
substitution are all equal to unity. Hence use the results of the previous 
example to check the formulae of 19.6. 

12. A good X is produced with n factors of production, the production 
function, x=f{a^f a„ a,, ... o„), being of any given form. The equations 


and /(a„ o„ a„ ... aj = a; 


: (r and « = 1, 2, 3, ... n) 


A A A /n 
give the demands for the factors in terms of the given output (x) and the given 
prices (p^, p,, p,, ... p^) of the factors. By differentiating these equations, 
show that the variations of the demands are given by 

S=J* (•=L2.3....n) 

when output varies (prices of the factors fixed), and by 
Ea^_p^ da^ 

when the price of one factor varies (output and prices of other factors fixed). 
Here, F, is the co -factor of in the determinant F and and a,., have the 
meanings of 19.6. Interpret the second results in terms of the competitive 
and complementary relations between the factors. 

18. A monopolist sells a good X at a price p on a market with demand law 
ajrr<^(p) and fixes output so that net revenue is a maximum (i.e. so that 
marginal revenue equals marginal cost). He uses n factors of production, 
obtainable at given prices ( Pi, p,, p*, Pn)» his production function, 
a;=/(a,, a,, a,, ... a„), is linear and homogeneous. Show that the price p and 
the amomits of the factors employed are given by , 

/. /. /.' 

where t) is the elasticity of the demand for the good. 

14. In the problem of the previous example, the price of the factor A, is 
varied while the prices of other factors are fixed. Differentiate the equilibrium 
equations and show that 

Eaf_p^ da^. 


and /(a„ o,, o„ ... o„) = ^(p) 

Jn ^ V' 


Epi a, ap. 


1 + - 


rf{rj- l)dpj 

where and ctj,. have the meanings of 19.6. Deduce that, in the normal case 
where t;> 1 fyid ^->0^ a fall in the price of increases the demand for each 
factor less in this monopoly case than in the case of competition (19.6 above). 

15. A consumer has a given income (p) and can buy three goods at given 
prices (Pi,p, andp ^.« One form of his utility function is w = where 

otj, a, and a, are positive constants. Find the demand of the consumer for 
each good in terms of p, p^, p, and p„ and show that 

Ex. Ex. _ 

i ss i 0. 

Ep. Ep, 

with similar results for the other two goods. 


^=.1 


Ex^ . 
Ep, ’ 
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16. Show that the utility function of the previous example satisfies the 
stability conditions and gives partial elasticities of substitution which are all 
unity. Use the results of the previous example to verify, in this case, that the 
general formulae of 19.7 hold. 

17. The equilibrium equations of 19.7 give the demands of a consumer as 
functions of his income (/x) and of the prices (p^, p,, Pj, ... p„) of the n goods 
obtainable. If income varies while prices remain fixed, differentiate the 
equations with respect to p and show that 

,x Sx^^ x^^^+x^^^ + ...+x„ ^„4>r (r=l 2 3 . ») 

Efi dfjL 0 ^ * * * *** ' 

where the utility function and the notation of 19.7 is adopted. 


18. In the problem of the previous example, the price of the good X, varies 
while income and the other prices remain given. By differentiating the 
equilibrium equations with respect to pj, show that 


Exr_p^^x, / Ex;\ 




n) 


where and have the meanings defined in 19.7. 


19. The scale of preferences of a consumer for n goods is defined by the 
differential equation 

(a^ 4- a^^Xi) dx^ + (a^ + a^^x^) da;, + . . . + (a,, + Onn^n) ^ 

where x^, a;,, a;,, ... represent any set of purchases made by the consumer 
and where the a*s are constants. Show that a utility function exists and 
verify that the demands of the consumer are stable, provided that ajj, a,,, 
o,,, . . . are all negative. Find the demand for each good as a function of 
the given income of the consumer and of the given market prices. Show that 
the demand depends linearly on the income (prices fixed) and is a ratio of 
quadratic expressions in the prices (income fixed). 


v20. A firm with given resources (labour, plant, raw materials, etc.) produces 
n goods and its outputs of the various goods are related by a transformation 
function F{x^f a;,, a;,, ... a;„) = 0. The goods are sold at given market prices, 
Pi, p,, Pg, ... p„, and the firm fixes the outputs to maximise its revenue 
2? = a;iPi +a;gPg + ... +a;^p„. Find equations giving the amoimt of each good 
supplied as a function of all the market prices. Under what conditions is the 
supply stable ? How can partial elasticities of substitution between the goods 
•in production be defined and used to describe the variations of supply of 
different goods as the market price of one good changes ? 

21. Extend the analysis of the demand of an individual for loans (14.9 
above) to the case where £x^, £x^, ... are incomes obtained in n 

successive years, related by a transformation function F{x^, a;,, oe,, ... a;,j) = 0. 
Assume that the individual has a utility function for incomes which can be 
written u= a;,, a;,, ... x^) and that the rate of interest is lOOr^ per cent, 
from the first to the second year, lOOr, per cent, from the second to the third 
year, and so on. , 



CHAPTER XX 


SOME PROBLEMS IN THE CALCULUS OP VARIATIONS 
20.1 The general theory of functionals. 

We have been concerned so far with the theory of functions of a 
finite number of variables, wdth the variations of quantities depend- 
ing on one or more other quantities. There remains for brief con- 
sideration an extension of the analysis, opening up many new fields 
of apph* cation, which takes a quantity as dependent not upon a 
finite set of other quantities but upon one or more variable functions. 
In the simplest case, a variable quantity u is defined as taking 
its value from the form assumed by a function x = <f>(t). To each 
function <f) (t) there corresponds a definite value of u, and as the form 
of the function is changed so is the value of u. The dependence of u 
upon <f> {t) is called o, functional and written u^F{(f)}. The functional 
sjunbol F is to be regarded in a manner analogous to the ordinary 
symbol for a function. It is essential, however, that the functionaji 
u=^F{^} should not be confused with the function of a function 
u^F{^{t)}. The latter assumes that is a given function of t 
and hence that u is also a definite function of t. The former takes <f> 
as a variable function, the functional F associating one value of u 
with each whole function The variable t does not itself appear 
in the .determination of u in the functional relation.* 

In diagrammatic terms, the variable function x = <f>{t) is shown 
by a variable curve C in the plane Oxt. As the form of the function 
changes (^, ^ 2 » •••)> curve C shifts and takes up different 

positions and shapes (Ui, Cg, Ug, ...). If is a functional of then 
its value depends pij the particular position taken by the curve C , 

• More generally, u may be a functional of several variable functions 
!C=^(0, 2 = x(0» ••• • write w = F{<^, 0, X» •••}• Further, the 

functions x» ••• be fimctions of several variables instead of one 

variable t only. For the general theory of functionals, see Vol terra, Theor^oJ 
Tunctionala (English Ed. 1931). * 
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and to the series of positions (Oi, ^8> •••) there corresponds a 
series of definite value of u (Ui, U 2 , ...)• For example, Fig„ 103 

shows five positions of a curve limited to pass through two fixed 
end-points A and B and there will be five corresponding values of u. 

An example of a fimctional can be taken from quite simple 
economic theory. Each member of a competitive group of firms 
producing a good X with known cost functions fixes that output (a;) 
which makes marginal cost equal to the given market price (p) of 



the good. Further, the output fixed by the firm varies when 
different market prices are given. A supply function^ x=f (p), can 
thus be defined for each firm. But if the firm has monopoly control 
and fixes output and price subject only to certain demand conditions, 
then the output is such that marginal cost equals marginal revenue. 
If the marginal revenue curve is given (given demand conditions 
and given prices of all other goods), there corresponds one definite 
monopoly output As the marginal revenue curve shifts in position 
(changing demand conditions or varying prices of other goods), the 
monopoly output is determined afresh and varies in value. The 
monopoly output x^ depends on the form of tlje^ marginal revenue 
function <l>{x) and we have the supply functional x^=F{<f}}* 

The step from functions to functionals can be regarded as another 
v instance of the step from the fim'te to the infinite so characteristic 

*^9 f 

^ See Schneider, Theorie der Produktion (1934), p. 74. 
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of mathematical analysis. If a;,, ... x^) is a function of a 

finite number (n) of variables, we can increase n indefinitely by the 
introduction of more and more variables. The function u then tends 
to involve an infinite number of variables. But this is just what is 
implied by saying that u depends upon the form of a variable 
function = The function, in fact, comprises an infinite 

number of particular values ...) and u depends, in 

the functional notion, on the whole of this infinite set of values. 
The fimctional u=^F{<l>} implies that u depends on the infinity of 
values making up the function a; = ^(<). 

20.2 The calculus of variations. 

The most important and frequent case of a functional u = F{<^} 
occurs when the form of F appears as an integraL Suppose that 
f{t) is a function changing in form as (f> (t) changes. Write 



where and are certain limits of integration. Then the value 
of u depends on what particular form we take for <f> (1) and hence for 
f(l), i.e. w is a fimctional of the variable function <^. The problem * 
that usually arises is to determine that function ^ which makes a 
maximum or a minimum. The analysis of this problem is termed 
the calculus of variations^ one branch of the much wider theory of 
functionals.* Many problems of importance in the varied applica- 
tions of mathematical analysis are found to relate to the calculus 
of variations. Two examples will illustrate this fact : 

Ex* 1. The problem of the surface of revolution of minimum area, 

A problem arising early in the development of the calculus of variations 
is that of determining that curve of all curves joining two fixed points 
A and B on the same side of a given line L which forms a surface of the 
smallest ar^ when revolved about the line L, Axes Ooct are fixed in any 
way so that Ot lies along L and so that the fixed points have co-ordinates 
A (^ 0 , a^o) ^i) which are all positive and ti> If x^<l>(t) is any 

curve lying above ^ and joining A and B, then it can be shown that the 

♦ Most text-books on advanced mathematical analysis include an account 
of the calculus of variations. See, for example, Courant, Differential and 
Integral Calcidus, Vol. II (English Ed. 1936), Chapter VII ; Osgood, Advanced 
• Calculus (1926), Chapter XVII ; de la Vall6e Poussin, Cours d'arud'jffse infini- 
UsimaU, Vol. II (1926), Chapter X. 
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area of the surface formed by revolving the curve about Ot is propor- 
tional to 



Here is a functional of the variable function </> and we seek that function 
(subject to the limitations indicated) which makes u a minimum. 

Ex. 2. The brachistochrone problem of John Bernoulli. 

One of the first problems in the calculus of variations, posed by John 
Bernoulli in 1696, is to fix a curve joining two given points 0 and A so 

that the time taken by a particle to 
slide under gravity along the curve 
from 0 to ^ is least. Axes Oxt are 
chosen, as shown in Fig. 104, with 
origin at the given point 0 and with 
Ox drawn vertically downwards. Let 
A have co-ordinates x^) which 
are both positive. If x=^</){t) is the 
equation of any curve joining 0 to 
A so that ^(0) =0 and =Xi, then 
it can be shown that the time taken 
by a particle to slide under gravity 

from 0 to A is proportional to 

ilere, again, w is a functional dependent on the form assumed by the 
variable function <f> and we seek that form of <f> which corresponds to the 
minimum value of u. 

20.3 The method of the calculus of variations. 

Two important preliminary points must be considered in devising 
practical methods of solving problems in the calculus of variations. 
The first point concerns what can be called boundary conditions. 
The problem to be solved is usually framed so that pnly certain 
arcs of the variable curve x = <f){t) in the plane Oxt are needed, 
definite conditions being imposed upon the points which mark the 
ends of the arcs. In the most frequent case, the conditions are that 
the arcs should start and finish at two fixed points A and B in the 
plane. The boundary conditions, in such a case, impose limitations 
bn^the field of possible variation of the curve x==<l){t) and we have 
to consider, in fact, only those curves which can be drawn from* 
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A to JS. This is so in the two problems instanced above and it is 
illustrated by Fig. 103, In analytical terms, the function x = <f>(t) 
can only be selected provided that 

and 

where A (to, Xq) and B (^, x^) are the two fixed end-points. Other 
boundary conditions may be imposed in less usual cases. It may be 
given, for example, that the variable curve = should join two 
points P and Q which he one on each of two given curves in the 
plane Oxt, Here, we have the boundary conditions <f)[t^=Xo where 
(^ 0 , x^ are the co-ordinates of P and satisfy some given relation 
^(to, = ^ J and similarly for the co-ordinates (ti, x^ of Q, 

Even when the boundary conditions are taken into account, the 
field of possible variation of the function x==(f>{t) is so large that 
analysis of the problem is practically impossible and it becomes 
essential to Umit the field of variation by some further device. An 
obvious step is to take only those functions, satisfying the boundary 
conditions, which are continuous and possess continuous derivatives 
up to any desired order. Further, the field can be more severely 
hmited by taking functions only of a particular type or curves of a 
particular class. We may take, for example, only functions of the . 
quadratic type represented by parabolas with vertical axis. We 
know that a function type or curve class can be represented by a 
relation involving certain parameters a, j3, y, ... ; the larger the 
number of the parameters the more general is the function type or 
curve class. The result of this limitation on the field of the variable 
function is to replace the function x = ^(l) of variable form by 

x = <f>(t ; a, p, y, ...), 

where ^ is now of fixed form and the variation of the function is 
replaced by the variation of the parameters involved. 

It is important to appreciate the nature of the step now taken. 
We have gjven up the consideration of any comprehensive variation 
in the function x = and have limited omselves to a more re- 
stricted variation described by parameters in a function of fixed 
form. If the parameters are few in number, the restriction is very 
severe. For example, using only three parameters, we may write 

<f>(t) = a/* + + y 

so that our variable curve is limited to the class of parabolas with 
* their axes vertical (parallel to Ox) But, by taking more and more 
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parameters, we can make the field of variation of our function type 
more and more general. K a sufficiently large (but finite) number 
of parameters is selected, the restricted field of variation can be 
made to differ in few important respects from the complete field ; we 
simply exclude from the latter the more unusual kinds of functions. 

The problem to be solved is now greatly simplified. We seek the 


extreme values of f(t)dt where f{t) depends on the variable 

function x^(f>{t). Limiting the latter in the way indicated, the 
expression to be integrated becomes /{( ; a, jS, y, ...) where a, j8, y, ... 
are parameters and where the form of the function / is fixed. It 
follows that the value of the integral u depends only on the para- 
meters ot, Pf y, ... and we require its maximum or minimum value 
for aU variations in the parameters allowed by the boundary con- 
ditions. The problem of the calculus of variations is thus reduced 
to a problem of extreme values of an ordinary function of several 
variables a, j3, y, ... . Functionals are changed back into functions 
and we proceed on familiar lines.* The extent to which the simplified 
problem approximates to the original one depends on the number of 
parameters we care to take. The most important thing about the 
ai^alysis which follows is that it is quite independent of how many 
parameters there are, provided only that their number is finite. 
The solution we obtain is not perfectly general but it provides an 
approximation sufficient for all practical purposes. 


20.4 Solution of the simplest problem. 

The function f{t), which gives the variable u on integration, 
depends in some definite way upon the variable function 
The dependence usually includes, not only ^ itself, but also the 
various derivatives of Of the many possible cases, only the 
simplest will be analysed here, the oaise where f(t) depends on the 
variable U on the function and on the first derivative : 

* The step from functions to functionals is reversed. A functional can be 
regarded as a function of an infinity of variables. In solving problems of 
functionals, we approximate by taking a function of a large number of variables 
(the' parapeters <x, y, . . . ). This is in line with the methods always adopted , 
for dealing with the infinite and the step made here is, after all, not unusual. 
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This is a funotion of t given in the function of functions form. Bht, 
since the funotion ^ included is of variable form, the form of the 
function / to be integrated is also variable. We assume, further, that 
the boundary conditions are such that the variable curve x = <f>{t) 
passes through two fixed points A (1^, Xq) and £(1^, x^). The problem 
to be solved is thus : 

The extreme values of the integral u = i: f{t. X, dt are required 

for all possible variations in the function x = <f>{t)^ such that 

where x^) and (^, x^) are fixed points. 


The problems instanced in 20.2 above are both of this form, 
first problem of the surface of revolution, we have 


/ 



In the 


and in the problem of John Bernoulli 

/(-.S)=vi{r7i7}. 

In solving the problem, we impose the limitations on the varia- 
tion of (0 already described, taking this function in the form , 

x = <}>{t ; a, y, ...) 

where ^ is a fixed function (with a continuous derivative) and 
where ocyp^y,.,. are parameters. Allotting arbitrary differential 
increments 8a, SjS, Sy, ... to the parameters, we derive first the 
corresponding variations Sx and 8x' in the function x and its 

derivative x' 

at 


and 



Tt should be remarked that all the variations here are ordinary 
differentials and subject to the usual rules of dififerentiatibn. They 



^ 8 ^ 
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are denoted by the symbol “ S ” in order to distinguish them from 
the other differentials, dx and dt, we obtain when we consider a; as a 
function of the values of the parameters being given. Hence, 
throughout our analysis here, “ d refers to variation in the variable 
t and “ 8 ’’ to variation in the parameters ct, P, ••• • 

The function /(/, x, x') and the integral u can now be considered 
as dependent on the parameters oc, y and the variations in 
their values are obtained as 




and 


8u=8{fV«, 

Now, using the result of 15.3 above, 

ms. -Hi m s.u]4 ffisx) -i m ax 

dt^dx j\dt\dx/ j J dt\dx ) dt\dx/ 


dfSf\^ dfd,^, dfdf\^ dfd,^, 
~dt [dx'J dt \dx') ^^~dx' 


Hence, except for the addition of an arbitrary constant. 



The expression for the variation in u then becomes 

Since, by the boimdary conditions, the curve x = (f){t) always passes 
through two fixed points at t = tQ and t = ti, it follows that the varia- 
tion of X is zero, i.e. Sa; = 0, at these points. Hence, 


and so 



8xdt. ^ 


The integral u is to have an extreme value for variation in the 
ftoction x = (f>{t) as obtained by varying the parameters 0 , j 8 , y, ... . 
The necessary condition for this is that Sii = 0 for all values of 
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Sa, S/3, Sy, , i.e. for all values of Sx. From the above expressibn 
for Su, this is only true if 

dx dt \dx' ) 


This result, known as Euler's equation, determines that function 

Pif 

x = <f>(l) which maximises or minimises the value of u. Since -- and 

df ^ 

are partial derivatives of /(/, x, x') and so functions of t, x and 

dx 

x' = — , the equation is a relation in a: as a function of t which involves 

CLv 


the derivative of x with respect to i.e. it is a differential equation 
which must be solved to give the function x=^(f>(t) we seek. Our 
problem in the calculus of variations is thus reduced to the relatively 
simple problem of integrating a differential equation. The methods 
for completing the solution are set out in Chapter XVI above. 
Euler’s equation, however, is only a necessary condition for extreme 
values of u. There remains the problem of distinguishing between the 
different extreme values. The number of variables (i.e. the para- 
meters a, y, ...) is here very large and, as we have seen in the 
previous chapter, a general criterion for separating maximum from 
minimum values is not readily obtainable. We must content our- 
selves with the fact that, in simple practical cases, we can usually 
tell from generak reasoning whether we have a maximum or a 
minimum value of u. 


20.5 Special cases of Euler’s equation. 

Two particularly simple cases of the problem of the previous 
section often arise in practice : 


( 1 ) 


To find extreme values oiu — 



dt, where the function 


to be integrated does not involve x explicitly. 


Writing and noting that ^ = 0, Euler’s equation becomes 


df 


dt 


dx 


i 



= 0 , 


giving —,f(t,x')=a, 

J ^ 

where a is some constant. This is a differential equation mvolvlng 

s M.A. 
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oiily the first derivative x\ i.e. a simple diflferential equation of the 
first order which can be solved, in most cases, without difficulty. 


(2) To find extreme values of where the function 

to be iutegrated does not contain i explicitly. 
dx 

Writing x' = ^ , differentiating / {x, x') as a function of two vari- 
ables each depending on t, and using Euler’s equation =-t ( ^/) 

dx at \dx J i 


d[_^dx ,dl^\ ^^_d( ,^\ 

dt~dxdt '^dx' dt~^ dt Kdx'J^dx' dt ~di T dx') ' 

Hence, / and x' ~ , having equal derivatives with respect to t, can 

only differ by an arbitrary constant, i.e. 

f(x, x’)=x'~f{x, x’)+a 


where a is some constant. Again we have a differential equation of 
the first order, involving only x and its derivative x\ The solution, 
giving the function x^<f){t) we require, proceeds with little difficulty. 
It is to be noticed that the two problems of 20.2 above are both of 
this special form. 


20.6 Examples of solution by Euler’s equation. 

The methods of solution developed here can be illustrated by the 
following three examples : 

/ dx\^ 

Ex. 1. To find extreme values of ti=J ^ subject to the 

boundary conditions that the curve x==^<f>{t) passes through two fixed 
points A {Iq, Xq) and 

We have the special case (1) of 20.6 with " 

d 


f{t, x') -= tx'^ and ^ / (t, x') = 2tx\ 


Hence, 


2tx' ■= constant, i.e. 


dx a 
dt^t 


where a is some constant. The integral is obtained at once as 

a: « J I di + constant, i.e. a; = a log t+b 

where 6 is a second constant. THe extreme value of uia thus given by a 
logarithmic curve. 
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The appropriate values of the constants a and b are to be found by utllng 
the boundary conditions. Since the curve must pass through the fixed 
points A and J5, we have 

a:Q=alogfo+^ and x^ — a\ogt^-\-h. 

On solving these linear equations, the constants are given as 


- »i-=go ft„. =Cologti-a^ilog <0 

log - log <0 log ti - log <0 

In this way we fix that particular logarithmic curve which passes through 
A and B, The use of the boundary conditions in determining the relevant 
values of the constants of the solution is to be noticed. 

Finally, the extreme value of u determined is 


u = 



= a2(log fi-log g = 


log ^i-log 


It is not possible to indicate here whether this is a maximum or a minimum 
value. 


Ex. 2. To find the minimum value of u = 1 + ^ where 

the curve x = (/){t) is subject to the usual boundary conditions. 

We have here the special case (2) of 20.6 with 


/(x, x') =X\/l 


and /(x, x') = 


XX 


s/l 




and Euler’s equation is 


xx'* 

xn/I +x'* = “7i JZ -ha, 

vl -hx ^ 


On multiplying up, squaring and collecting terms, we find 
x^-a* . dx Jx^-a^ 


r'2=- 


i.e. 


dt' 


TT • { dx Cdt ^ ^ t-b 

Hence, constant = — 

where both a and b are constants. It is easily verified that 


d r, x + Vx^-a^^ 1 

^ I dx\ ^ a J Jx^ —a^ 

and the solution of our differential equation becomes 

, x + Vx^-a* t —b . /— ^ ^ ^ 

® a a 


S2 


M.A. 
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Nc'w, 

X - -a^)(x + Jx^ -g^) x^ - (x^ - a^) a* ae”^ 

x-\-Jx^-a^ x-\-sJx^ -a^ ae^ 

/ ti? 

Adding, a: = Ja\e«+6 ^ J 

w^hich is the function giving the minimum value of u. The boundary con- 

ditions provide the values of a and 6 in terms of {Iq, Xq) and Xi ) : 

/ to-b\ / u-b 

x^^\a\e ® +e ) and Xy^^\a\e ® +€ ^ 

The minimised value of u is then given by 

j /dx\^ dx Jx^ - 

—J.Wi+ls)'® a — 


x^ 1 r«i / 2«-&) \ 

i.e. —dt^-a\ \e « +e « -¥2] di. 

This integral can be evaluated fairly easily. 

( JjzP\ 

The curves corresponding to the function t 3 rpe x^\a\e^ + c” ® ) are 
called ** catenaries *\ The curve which gives the minimum value of u 

is thus that catenary which passes 
through the two fixed points A and B 
given by the boundary conditions. 
The shape of this curve is shown in 
Fig. 105. The solution of the problem 
of 20.2, Ex. 1, is now obtained ; the 
catenary is the curve which gives 
a surface of revolution of smallest 
area. It is to be noticed that the 
nature of this geometrical interpreta- 
tion of the problem shows that the 
value of u we have found is a 

minimum and not a maximum. 

Ex. 3. To find the minimum value of = 
to the usual boundary conditions. 

This is again the special case (2) of 20.6 with 


'IH 


dt subject 



f(x, x') = and “ 

and Euler’s equation is 

11+^ z’* 

W ^"“ T " " """ 

^ ^ Jx(l-¥X'^) 


a 



SOME PROBLEMS IN THE CALCULUS OF VARLitTIONS « 6^ 

We find, on multiplying up, squaring and collecting terms, • 

1 -a^x . dx I ll -a^x 

— 2 — - i.e. • 

a^x dt a ^ X 

The integral of this differential equation is 

{dt ^ ^ t-b 

^ aa; = I — h constant = 

1 -a^x j a a 

where both a and b are arbitrary constants. It is now only a matter of 
evaluating the integral sho^vn. This can be done only with the aid of 
“ trigonometric ” functions and we cannot proceed further here.^ It can 
be stated, however, that the relation between x and t obtained on com- 
pleting the integration is represented by a curve of a well-known class, 
the class of ** cycloids The solution of our problem, taking account as 
before of the boundary conditions, is given by that cycloid which passes 
through the two fixed points O and A, It is this curve which gives the 
minimum time of descent from O to A (see the brachistochrone problem 
of 20.2 above). 

20.7 A dynamic problem of monopoly. 

A monopolist sells a good X on a market consisting of a com- 
petitive group of consumers. If he produces an output x per unit of 
time, the cost of production is given by n(x) and remains unchanged* 
throughout the period from t = tQ to t = ti which w© consider. It is 
assumed that the market price of X varies continuously over the 
period and is represented by some function p (t). The demand of Che 
competitive group of consumers is assumed to involve a “ specu- 
lative ” element and to be described by the demand law given in 

16.7 above : 

X = <f>{p{t), 

The market demand thus varies over the given period according to 
the course taken by the price of X. The monopolist’s profit per unit 
of time when he produces an output x per unit of time and sells it 
at the priae is given by 

xp{t)-n{x). 

This is the rate of profit at any time t and depends on the price and 

♦ The solution oT the problem can be written : 

^ = 6 + a'(^-8in ^), a; = a'(l - cos ^), 

where a' is an arbitrary constant obtained from o. The required relation 
between x and t is obtained by eliminating the parameter d. The exprqgsiona 
sin Q and cos 6 are trigonometric ratios of the angle d radians. * 
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the rate of change of the price at this time. The total profit in the 
given period from t^t^ to t = ti is obtained by the addition of the 
profits in successive units of time. Since the variation of price and 
output is assumed to be continuous, the addition can be represented 
by an integral, i.e. total profit is 

u = I i^P “ ^i^)} 

For a given course of price over time, the demand x of the market is 
determined at each moment and there is a definite value of u corre- 
sponding. Different values of u are obtained for different courses of 
price. Hence, the monopolist’s total profit depends on the form 
of the price function over time, on the course taken by price. It is 
assumed, finally, that the ruling price at the initial moment is 
given as po and that the subsequent course of price is fixed by the 
monopolist so as to maximise his total profit in the whole period 
from t^tQ to t = ti. It is required to determine the course of price, 
i.e. the form of the price function p(t), actually fixed by the mono- 
polist. The corresponding output at successive moments is then 
derived from the given demand law.* 

' As a first problem, it is taken that the choice of the monopolist 
is restricted by the fact that the final price {pi at < = is given in 
addition to the initial price. (This artificial assumption will be 
dropped when it has served its purpose.) We have then a problem 
in the calculus of variations : to find the function p {t) which 

maximises the value of {xp-n{x)}dty where x — <f>(pyp') and 

’where the boundary conditions give p=Po when < = P^Pi 

when This is the special case (2) of 20.5 with 

f{p,p')—ocp-n{x) where x = <f){p,p') 
as the function to be integrated to give u. 

* The problem is based on the work of Roos and Evans, see Evans, Maths^ 
maiical Introduction to Economics (1930), pp. 143 et aeq. Notice that, if prices 
do not change continuously but are fixed at definite intervals, then the mono- 
polist’s total profit is a fimction of a finite number of plic es (one for each of 
the intervals in which prices are fixed). The problem of maximising total 
profit is then relatively simple, a problem of the maximum value of a function 
of several variables. It appears that here, as in many problems of capital 
theoi5% the discontinuous form is both more realistic and more simple from 
the mathematical point of view. 
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9 ft M 9a; 

ipnp.p)=p^. 


dn dx 
dx dp 


Hence, 

' V dx) dp’ 

and Euler’s equation gives 

xp - n{x) =p' (p - ^ + constant. 

This is a first-order differential equation in the function p{t), 

dp 

remembering that = p*) is a given function of p and p^ = . 

at 

The solution of the equation gives the function p (t) required except 
that it involves two arbitrary constants (the constant above and a 
constant of integration). The appropriate values of the constants 
are given by the two boundary conditions fixing the initial and final 
prices. The problem is thus theoretically determinate and the 
following particular case shows how an actual solution can be 
obtained when we know the forms of the demand and cost functions. 

Suppose that the demand function is linear, x=ap-\-b^cp\ and 
the cost function quadratic, where the six co- 

efficients are all constants. The equation for p [t) is now 

p {ap -f 6 -h cp') - 77 = cp' ^p - -h constant, 

dn dp 


i.e. 


c -7- -h ap2 -h 6p - 77 = constant. 

dxdt 


Differentiating with respect to t and collecting terms, 

dmd^p dm dp / . dn\ ^ 

But; ^^=2ccc + )3 = 2ca^ + 2aap + (2ta+;3) and ^^ = 2«. 
The differential equation for p (t) then reduces to 

• 2c^a^ -2a(aa- l)p-h(6 - 2a6a ~a^) = 0, 


I.e. 


where 


d^p 


« 6 ~ 2ab(x - aB 


=XHP-P). 


and A* = 


a(a(x - 1) 


c^oc 


2a {aa - 1) 

^(A* is positive if the constants have the signs appropria^ tcT the 



636 , MATHEMATICAL ANALYSIS FOR ECONOMISTS 
r 

normal case, i.e. a <0 and a> 0). The solution of this simple second- 
order differential equation is known to be 

'p —p + 

where A and B are constants of integration. The solution can be 
checked by differentiation. The constants p and A are given in 
terms of the coefficients of the demand and cost laws. The constants 
A and B are to be found in terms of the initial and final prices by 
substituting p when < = p =pi when < = The problem is 
solved and the course of prices over time, as determined by the 
monopolist, is given uniquely. 

Returning to the general case, given the final price pi, we have 
shown that the price function p {t) and the maximised value of total 
profits u can be obtained. The total profits (as maximised) must 
be a function of p^y say u=F{p^. To complete the problem, the 
monopolist has only to fix that final price (of all possible final prices) 
which maximises as a function of p^y i.e. which gives the largest 
total profits of all. For this, we put 

du ^ 1 ^ . dhi 

= 0 subject to - 7 — r < 0 

dpi 

and determine p^ from the resulting equation. 

• 

20.8 Other problems m the calculus of variations. 

The simple problem in the calculus of variations treated above is 
capable of generalisation in a number of directions.* For example, 
the function in the integral u may involve second and higher order 
derivatives of the variable function or the variable function may 
depend, not on one variable ty but on several variables t^y t^, ... . 
Further, the extreme value of u may be required subject to a number 
of given side relations in the variable function, the problem of 
relative maximum and minimum values. But perhaps the most 
useful extension of the problem arises when the integral u depends 
on several variable functions instead of on only^one such function. 
This extension can be illustrated by the case where there are two 
variable functions, and y = 0(^), and where the integral to 


• See Courant, op, cit,, pp. 607-20. 
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• * 4 

be maximised or minimised is of the simple form (analogous to that 


of 20.4 above) : 


u 


f'l) 


dt 


subject to the boundary conditions that x and y both assume fixed 
values when 1=^1^ and 

It can be shown (following the argument of 20.4) that, when the 
parameters of the variable functions are changed so that hx and hy 
are the resulting increments in the functions themselves, then the 
variation of u is 


dy 


dx 


where x*—-^ and ^ . The necessary condition for an extreme 
value of is that = 0 for all variations ^x and Sy. Hence, 

- 1 = 1 ( 1 ) ■ 
i.e. there are two differential equations (each similar to Euler’s 
equation above) to be solved for the functions x = ^(^) and y — ip{t). 
An economic example of the present problem can be taken from* 
a theory of saving developed by Ramsey.* There are two variable 
factors of production in a community, i.e. labour B and capital C, 
each of which is taken as homogeneous in nature. If amounts b and c 
of the factors are used, the total product (or income) of the com- 
munity is given by the production function : product =/(6, c). The 
variations of labour and capital over time are assumed to be con- 
tinuous and represented by the functions b(t) and c(0. The rate of 

dc 

growth of capital, or the saving of the community, is ^ at any time. 

The consumption (or expenditure) of the community is then 

do 


a{t)=f(b, c)-^. 


•( 1 ) 


It is assumed that the utility of the community’s consumption is 
measurable and tiependent only on the amount of the consumption, 
w = ^(a). Similarly, the disutility (assumed measurable) of labour 


♦ See Ramsey, A Mathematical Theory of Saving, Economic Journal,^! 92a 
A modified version of this theory is given here. • 
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t 

is t^aken as v=^(&). Over a period of time {t^, the total net 
utility of the community is 

where future utilities are not discounted. This quantity C7 is a 
functional of the variable functions b{t) and c{t), representing the 
changes in the employment of labour and capital over time. The 
function a{t) is expressed in terms of b{t) and c{t) by (1) above. It 
is assumed that the amount of labour and capital employed at the 
initial time (< = <o) given and that the “ optimum ** amounts to 

be aimed at (^=^) are also known. Subject to these boundary con- 
ditions, the community is taken as fixing the amount of work it does 
and the amount of saving it makes over time to maximise total 
utility. We have, therefore, to find the functions b(t) and c(t) 
which maximise the value of U, subject to the usual boundary 
conditions. 

Write c'=^ and 
ctt 

17 = r F(b, c, c’)dt, 

where F (6, c, c') = ^{a)-t// (6) and a =f {b, c) - c'. 

The conditions for the maximum of U are 


dF /dF\ ^ 


and 


dJP_d /dF\ 
dc dt \9c'/ * 


The first condition gives 


i.e. 


g'=f(a)|-f(6)=0. 
df f(6) 


db <f>'{a) 

The second condition gives 

.df_dF _d (dF\ d, 

^ * dc dc dt \dc') dt^ ^ ^ 

df 1 d 


.( 2 ) 


i.e. 


dc dt 






.(3) 


The functions a{t), b{t) and c{t) are to be found from the differential 
equations* (1), (2) and (3), the boundary conditions being used to 
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evaluate the arbitrary constants that appear on integration. Xhe 
solution of the problem is determinate in general. 

rif fif 

The partial derivatives ~ and ~ of the production function 

measure the marginal products of labour and capital at any time and 
can thus be associated with wages and the rate of interest respec- 
tively. The rate of wages, by (2), equals the ratio of the marginal 
disutility of labour to the marginal utility of consumption. The rate 
of interest, by (3), is equal to the proportional rate of decrease of the 
marginal utility of consumption over time. Assuming that the rate 
of interest never becomes negative, the marginal utility of con- 
sumption falls over time either until it vanishes or until the rate of 
interest becomes zero. Further, if <f>'(a) is a decreasing function of a 
(decreasing marginal utility), then consumption rises over time until 
its marginal utility or the rate of interest is zero. 

An expression for the rate of saving can be found as follows. 
Using the results (1), (2) and (3), we have 

I (^■(a)/(6, c)) = f '(.) pit. c) + fla) (I § + IJ') 

= m c) + f (6) § - 1 {^'(a)}{/(6. c)-a) . 

= ^"(a) ^/(6. c) + m ^ J {/(6, c) - a} ^ 


So c) =|a ^ dt + J^'(6) ^ + constant 

c=a<f)'{a) + constant ' 

=a^'(a) - {<f> (a) - ift (6)} + constant, 
i.e. * (a) {/(6, c) - a} = constant - {<f> (a) -tj) (6)}. 


• Since ^ {af(a)) = a^ {f (a)} + . we have 

f ^ 

a^'(o) = + 1 <l>'{a) ^ 

jo ^ {f (o)}* = - |f («) ^ *• 
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Aqnce, using (1)‘ we find 

^ dc A- 

dt~' <j>\a) ^ ^ ^ 

where ^ is a constant. 

Since 17 is to be a maximum (as opposed to a minimum), the 
excess of utility over disutility, ^(a) - 0(6), must increase over time. 
Further, it can be taken that capital increases at a decreasing rate 
until it becomes stationary at the “ optimum ** at ^ = So, from (4), 
the excess of utihty over disutility increases and tends to its maxi- 
mum value A at Then, from (4), the rate of saving multiplied 

by the marginal utility of consumption at any time equals the 
amount by wliich the excess of utihty over disutihty falls short of 
the optimum amount A. As time goes on, consumption increases, 
the marginal utihty of consumption decreases and the rate of saving 
decreases until it becomes zero. 


EXAMPLES XX 

Problems in the calculvs of variations 

1, Find the curve x—<f>(t) which corresponds to an extreme value of 
cti /dx\ * 

t^{—\dt and passes through two fixed points at t — t^ and t — t^, 
^ J^o \dt/ fdx\* 

* Generalise by considering ti = 1 t,^\-j:)dt, where n is a fixed number, on 
similar lines. ^ 

fdx\* 

< 0 ^* \5^/ extreme value when the function 

X — <f)(t) is of the form x — a^t- h. How do the boundary conditions that the 
curve passes through fixed points at ^ = and t=^t^ determine the constants 
a and h 7 

3. Show that an exponential curve gives the maximum or minimum 

I tx 1 fdx\^ 

t« \^/ subject to the conditions that the curve passes 
through fixed points at and ^ = 

4. If u = ^ 1 1 d- I where fixed values of x correspond to 

t=to and t = t^, find the function x=<l>(t) which gives u its exti^eme values. 
Show that the corresponding curve is an arc of a parabola. 

5. Light, travelling in a plane Oxy, follows a path t/= ^(x) which is deter- 
mined (by Fermat’s principle) so that the time of transit^ is a mini mum. Th is 

is equivalent to finding the function <f> which minimises u 

where v is a function of x' and y denoting the velocity of light at various 
points. Assuming that v is a constant, show that the path of light between 
two tfxed points is a straight line. 
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6. The velocity of light, in the problem of the previous example, increase 
in proportion to the height above the zero horizontal line Ox, Vxitv^ky 
and show that the path of light between two fixed points is an arc 6f a circle. 


7. If f(x) is a given function and w = 




dty where x has 


fixed values at f = fo and t = show that an extreme value of m is given by the 
function x — defined by 


J ^a{f(x)}^-l 

where a and 6 are constants to be determined by the boundary conditions. 
Show that the previous three examples are particular cases of this general 
result. 


8. A radio manufacturer (see Examples V, 16, and Examples VIII, 33) 
produces x sets per week at a total cost of £17, where 77=^^* + 3a; + 100. 

The demand of the market is 76 - 3p + 276 ^ sets per week at any time 

when the price is £p per set. The manufacturer is a monopolist fixing the 
course of price over time so that his total profit is a maximum. The initial 
price (^ = 0) is £16 per sot and it is required to obtain a price of £20 per set 
after 100 weeks (t=100). Show that, at any time t weeks after the initial 
week, the price per set is 

p= 16-179 + 0171«®»"< + 0-660e“®«»<. 

Take all figures correct to three decimal places and logi^e = 0-4343. 


9. In the problem of the previous example, show that the price falls to a 
ooinimum after about 20 weeks. What is this minimum? Draw a graph to 
illustrate the course of price over time. 


10. In the solution of the problem of 20.7, show that 

^_(Pi-p)e-^-(Po-p)e->*> (Pi -p)e^ * 

« 

11. Under the conditions of the previous example, show that the price p 
falls in the period {tot t^) from po to & minimum and then rises to p^. Verify 
that the minimum price occurs when 


1 1 B 


12. In the problem in the theory of saving analysed in 20.8, the production 
function f{b, c) is linear and homogeneous. Show that 

- 60'(6)=a^'{a) + |{c^'(a)) 


and interpret^this result. 


Show, also, that 
ldc_df o^'(a) - 
c dt 


bf(b) 


dc c<f>'(a) 

What is the rate of interest at the optimum time (< = when capital ceases 
to accumulate! ^ ^ 
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Infinite derivatives, 147-^ 
discontinuities, 101, 148, 188 
limits, 86-98 

properties of numbers and spaoo, 
6-7, 16-7 * • 

series, 446-9 

Inflexional values, 181, 182, 191-5, 
460-1 

Integrals and integration, 384-99 
and areas, 384, 387-90 
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Integrals and integration as inverse 
differentiation, 384, 390-3 
in the calculus of variations, 623 
of differential equations, 390-1, 
393, 412--34 
standard forms, 394 
Interval, 8 

Inverse functions, 29, 171-2 
Irrational numbers, 4-5, 12, 86 

Jacobians, 474, 476, 490 

Lagrange's multipliers, 366, 379 
Leibniz’s Theorem, 173 
Light, path of, 540-1 
Limits, of functions, 88-98, 105, 269 
of sequences, 85-8, 446 
Linear differential equations, 417-26 
equations, 64, 60, 483-6, 493 
functions, 42, 67-8, 76-7, 228, 269 
homogeneous functions, 316-6, 317- 
9, 481-2 

Logarithmic derivation and dif- 
ferentiation, 246-8, 300, 331 
functions, 217-9, 228, 234, 237, 242-6 
graphs, 222, 225-6, 228, 262 
scales, 219-28 
series, 466 

Logarithms, 213-7, 237 
Logic and mathematics, 1-2 
Logistic curve, 265, 419 

Magnitudes, 10, 13-4, 276-8 
Maximum and minimum values, 1 80- 
91, 206, 352-9, 364-9, 469-61, 
496-602, 623-4, 626-33, 636-7 
Mean Value Theorem, 452 
Minors of determinants, 478-80 
Monotonic fimctions, 40, 269 
Multi-valued functions, 38-41, 102-3, 
206, 269, 334-9, 466-8 

Natufal exponential and logarithmic 
functions, 234-7 
scales, 21t.®“i27-8 

Necessary anS sufficient conditions, 
183, 186, 193, 204-6, 356, 366-6, 
461, 497, 499 
Neighbourhood, 8 
Normal curve of erpd*, 240 
Number e, 230-1, 466-6 
Numbers, 3-7, 85-7 

Ordinate, 21 

-jAOrigin of co-ordinates, 20-1 


Orthogonal curves and surfaces, 459- 


Parabola, 33, 69-72, 76-8 
tangents to, 145 
Paraboloid, 272, 273, 308, 422 
Parametric constants, 43-6, 46-8, 76- 
80, 625-6 

Partial derivatives (see derivatives) 
derived functions, 298 
differential equations, 433-4 
elasticities of functions, 300, 305, 
421 

stationary values, 361-2 
Pearson’s curve system, 240 
Pencil of lines, 67 
Plane, 272, 292, 305-7 
Planes, co-ordinate, 22 
Plano sections of surfaces, 272-5 
Power function, 161-3, 211, 218-9, 
228 

series, 460 

Quadrants of a plane, 18 
Quadratic equations, 6-6, 7, 30, 51 
forms, 485-92 

functions, 42-3, 71-2, 76-7, 269-70 
homogeneous functions, 316 
Qualities, 10 
Quantities, 10-6 

Radius vector, 190, 191 
Rates of change, 48, 134-7, 142, 149- 
60, 179, 246, 261, 303, 306, 310, 
326 

Rational numbers, 4, 11-2, 86 
Real numbers, 6 

Rectangular hyperbola, 33, 72-6, 76- 
9, 103 

Relative maximum and minimum 
values, 364-9, 498-600, 601-2 
Ridge and trough lines, 288, 361-2, 
357-9 

Roots of an equation, 61 
Ruled surfaces, 316 

Saddle points, 363, 496 
Scatter diagram, 26, 36, 222-3, 220 
Semi -logarithmic graphs, 221, 223-5, 
228, 237, 246 

Sequences of numbers, 85-8, 446 
Shortest distance of point from hne 
and plane, 206, 380, 600-1 
Side relations, 364, 498 
Sigma notation, 385 ^ 
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Si{apson*8 rule for integration, 399 
Single-valued functions, 38-41, 269 
Smoothm>ss of curves, 100, 147 
Space, location of points in, 16-23 
Speed, 15, 160-1, 152 
Sphere, 292, 368, 422, 430, 601 
Standard deviation, 406-7 
Stationary values, 180, 194, 363 
Step-functions, 31, 34, 91, 102 
Straight line, 63-9, 76-7 
Surface classes and systems, 272, 416, 
423-30 

Surfaces and fimctions, 270-6 
of revolution, 292, 623-4, 632 
Symmetrical fimctions, 41, 269 


Tangent planes to suHaces, 307-9, 
340, 416, 423 

Tangents to curves, 143-8, 338-9, 413- 
4, 423, 427 

Taylor’s series, 179, 463, 468 
Time series, 36, 44, 221, 223-6, 382-S 
Total differential equation, 421-2 
Trapezoidal rule for integration, 398 

Units, change of, 13-4, 16-6, 139 

Variables, 8-9, 29-32, 43-6, 46-8, 98-9 
Velocity, 16, 160-1, 162 

Young’s Theorem, 301-2, 309 


ECONOMIC APPLICATIONS 


Average cost, 120-1, 166-6, 261-3, 
401, 605-6 

product, 133, 178, 267, 312-3, 321-2 
revenue, 117, 163-4, 267-60, 400-1 

Bilateral monopoly, 381 

Capital and interest, 234, 248-60, 
362-4, 403, 404-6 
and saving, 637-40 
values, 233, 401-3, 404-6 

Coefficient of relative cost of produc- 
• tion and of relative efficiency of 
organisation, 261 

Competitive and complementary 
goods and factors, 282, 311, 361- 
2, 609, 613, 619 

Compound interest problems, 31, 
228-34, 235-7, 248-60, 362-4, 
376-8, 401-3, 404-6, 418, 419 

Conjectural variations, 203, 346-7 

Constant outlay curves, 256-7 

product curves, 286, 320, 341-3, 
370-1 

returns to scale, 320, 363, 371-2, 
606 

Consumers’ choice, theory of, 124-9, 
289-91, 374-8, 438-42, 609-17, 
520 

Cost functions, curves and surfaces, 
117-9, 166-6, 262, 283, 370-2, 
606, 633-6 

average and marginal, 120-1, 166-6, 
261 - 3 ? 371-2, 401, 444, 605-6, 622 


Cost functions, elasticity of, 260-3 
normal conditions of, 119, 261-3 
particular forms of, 120, 166-6, 199, 
262-3, 636 
shifting of, 118, 120 

Demand and offer curves, 121-2 
Demand functions, curves and sur- 
faces (for consumers’ goods), 28- 
9, 108-10, 121, 281-3, 374-6, 434, 
609-17 

continuity of, 110-2 
dynamic forms of, 435-8, 633-6 
elasticity of, 117, 265-60, 266, 310- 
2, 376, 612-3, 620 
normal conditions of, 112, 267-9 
particular forms of, 112-6, 117, 154, 
199, 269, 282, 311-2, 435-8, 635 
shifting of, 110, 116-6, 282, 283 
stability of, 376, 610-2, 614-7 
Demand functions (for factors of 
production), 369-74, 484-6, 602-9, 

619 

elasticities of, 373-4, ^i/8-9, 519 
stability of, 370, 602-ir 
Demand functions (for loans), 376-8. 

620 

stability of, 377 

Discount curves! 936-7, 260, 293 
Disutility of labour, 637-8 
Duopoly problems, 200-4, 346-7 
Durable capital goods, 404-6 


Elasticities of cost, 260-3 
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Elasticities of demand (consumers’ 
goods), 117, 255-60, 266, 310-2, 
375, 512-3, 520 

of demand (factors of production), 
373-4, 508-9, 519 
of productivity, 263, 350 
of substitution, 341-3, 344-6, 372-4, 
604-5, 612-3, 520 
of supply, 267 

Factors of production, 263, 284-9, 
38U2, 537 

demand for, 369-74, 602-9, 519 
relative shares of, 364 

Increasing and decreasing returns, 
261, 263 

Indifference curves and surfaces (for 
consiuners’ goods), 124-7, 166, 
289-91, 344, 376, 439, 441-2, 611 
(for income over time), 127-9, 157, 
291, 344, 377-8 

Indifference direction, 439 
plane, 440, 513 

Investment, amount of, 228-32, 233, 
235-6, 402-3 
opportunity line, 124 

Joint production, 350, 359-62, 518 

Linear homogeneous production fimc- 
tion, 320-2, 343, 371-4, 434, 605- 
9, 541 

Loss leaders, 362 

Marginal cost, 165-6, 261-3, 401, 606- 
6, 522 

disutility of labour, 639 
increment of revenue, 153 
prpduct, 178, 267, 312-3, 321-2, 539 
productivity law, 372, 381-2, 434, 
506 

rafe of return over cost, 156, 159, 
234, 345, 376-8 

rate of 5!Sl|titution, 156, 286, 341- 
3, 344^' 372-4, 439-41, 511-2, 
513-7 

rate of time preference, 157, 344, 
377-8 

revenue, 16, 15J-I, 257-60, 400-1, 
622 

supply price, 156 
utility, 265, 314, 344, 637 
utility 6f money, 612 
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Mathematiccdunethods in econdhuc 
107-8 ^ 

Monopoly problems, 196-2^, 369-64, 
381, 518, 519, 522, 53^6 

National Savings Certificates, 230, 
240 

Pareto’s income law, 222, 228, 407-8 
Period of production, 262-4 
Preference directions, 442 
lines, 442 

scales, 124, 290, 438-42 
Present or disco\inted values, 232-^ 
236, 249-60, 362-6, 376, 401-3, 
404-6, 448-9 

Price, as average revenue, 117, 163 
as a discontinuous variable, 9, 111, 
534 

variable over time, 434-8, 533-6 
Production, capital and interest, 362- 
4, 403 

Production fimctions and surfaces, 
133, 178, 263, 267, 284-9, 312-3, 
362, 537 

linear and homogeneous, 320-2, 
343, 371-4, 434, 505-9, 541 
normal conditions of, 286-9, 313, 
320-2 

particular forms of, 288-9, 313, , 
322, 343, 363, 403 
Production indifference curve, 123 

Rate of interest as marginal prodiiot 
of capital, 639 

Reaction curves, 201-4, 345-7 
Revenue, total, average and mar- 
ginal, 15, 116-7, 153-4, 267-60, 
400-1, 622 

Saving, rate of, 540 
theory of, 537-40 

Spatial distribution of consumers, 
80-2 

Speculative demand and supply, 434- 
7, 633-6 

Stability of demand, 370, 376, 377, 
602-3, 610-2, 614-7 
of price over time, 436 
of supply, 520 

Substitution, 340-6, 372-4, 439-41, 
503-5, 509, 612-3, 517, 520 
Substitutional and limitational fac- 
tors, 381-2 
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functional^ 622 
yt^upply functions and curves, 121, 
' ^ 1S2‘;209, 434, 520, 622 

dynamic forms of, 435-8 
elasticity of, 267 
stability of, 520 


Taxation, 60, 208 
Time-lag in production, 436-8 


Transformation functions and curves, 
122-4, 166, 234, 284, 346, 37G-8, 
520 

Utility functions, 126, 289-91, 302, 
313-4, 439, 441, 471, 637 
marginal, 265, 314, 344, 637 
particular forms of, 127, 291, 314 

Wages as marginal product of labour* 
539 
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